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REGULATION OF GENE EXPRESSION IN TOBACCO FOR MANIPULATION OF 
PLANT GROWTH AND SECONDARY METABOLISM 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 This application is a continuation-in-part of US Patent Application Ser. No. 60/ 132,919, filed 

May 6, 1999, now abandoned, which is hereby incorporated by reference in its entirety herein. 

FIELD OF THE INVENTION 
This invention relates to enzymes involved in alkaloid, and specifically nicotine, formation in 

10 tobacco plants. The invention is based, at least in part, on the nucleotide sequences encoding four 
variants of putrescine N-methyltransferase (PMTl, PMT2, PMT3, and PMT4), two variants of 
arginine decarboxylase (ADC 1 and ADC2), ornithine decarboxylase (ODC), S-adenosylmethionine 
synthetase (SAMS), a fragment of NADH dehydrogenase, and a fragment of 
phosphoribosylanthranilate isomerase. The invention also relates to proteins expressed by these 

15 nucleotides, promoter regions of these nucleotides, use of these promoter regions to culture 
transgenic plant cells and to produce transgenic plants, sense and antisense nucleotides 
complementary to all or portions of these nucleotide sequences, use of sense and antisense 
nucleotides to regulate gene expression, and assays using proteins involved in alkaloid formation in 
tobacco plants. 

20 

BACKGROUND OF THE INVENTION 

I. Alkaloid Formation 

Alkaloids are one of the most diverse groups of secondary compounds found in plants and 
they are the product of a complex biosynthesis pathway (Hashimoto and Yamada, 1994; Chou and 
25 Kutchan, 1998; Waterman, 1998). Why plants accumulate these compounds and in so many 
different forms is not known. Moreover, for many alkaloids, the exact site of synthesis and the 
factors that control their intercellular distribution and accumulation remain to be determined 
(Hashimoto and Yamada, 1994; Kutchan, 1995; Chou and Kutchan, 1998). 

Nicotine is the most abundant alkaloid present in cultivated tobacco. Nicotine is formed 
30 primarily in the roots of the tobacco plant and subsequently is transported to the leaves, where it is 
stored (Tso, Physiology and Biochemistry of Tobacco Plants, pp. 233-34, Dowden. Hutchinson & 
Ross, Stroudsburg, Pa. (1972)). 

The synthesis and accumulation of nicotine and other tobacco alkaloids are known to be 
controlled by various developmental, environmental, and chemical cues. Changes in phytohormone 
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(e.g., auxin, cytokinin) levels and/or ratios as a consequence of developmental age (Hashimoto and 
Yamada, 1994; Kutchan, 1995) or by direct manipulation of plant cell culture conditions have been 
shown to affect the synthesis and accumulation of nicotine and various tobacco alkaloids (Hashimoto 
and Yamada, 1994; Hibi et al, 1994; Eilbert, 1998). Various abiotic factors (wounding, drought 
5 stress, pH imbalance, etc.) [Hashimoto and Yamada, 1994; Kutchan, 1998; Watcnnan, 1998) 1, 2, 4], 
as well as biotic factors, such as herbivory, insect feeding, and attack by various microbial and fungal 
pathogens, are known elicit increased production of nicotine and other alkaloids in the leaves of wild 
and cultivated tobacco species (Baldwin, 1989; Saito and Murakoishi, 1998; Baldwin and Prestin, 
1999). In addition, the commercial practice of topping (i.e., removal of flowering head and young 

10 leaves at the upper portions of the plant), results in increases in nicotine and the amount and 

complexity total alkaloids present in the leaves of Nicotiana tabacum (Hashimoto and Yamada, 
1994; Hibi et al, 1994). The factors controlling the topping-induced increase in alkaloid 
biosynthesis are not known, but likely involve a complex physiological response in the plant as a 
result of altered phytohormones and wound induced signaling (Akehurst, 1981; Hibi et al, 1994; 

15 Kutchan, 1998). In this regard, considerable evidence now exists indicating that a jasmonic acid 
(JA)- mediated signal transduction pathway may play a role in regulation of gene expression 
contributing to this increase in alkaloid biosynthesis (Baldwin et aL, 1994, 1996,1997; Ohnmeiss et 
aL, 1997; Imanishi etaL, 1998a, 1998Z?). 

The nicotine molecule is comprised of two heterocyclic rings, a pyridine moiety and a 

20 pyrrolidine moiety, each of which is derived from a separate biochemical pathway. The pyridine 
moiety of nicotine is derived from nicotinic acid. The pyrrolidine moiety of nicotine is provided 
through a pathway leading from putrcscine to N-mcthylputrescine and then to N-methylpyrroline. 
(Goodwin and Mercer, Introduction to Plant Biochemistry, pp. 488-91, Pergamon Press, New York, 
(1983)). 

25 Putrescine is formed in plants by one of two pathways (Chattopadhyay and Ghosh, 1998). It 

can be synthesized directly from ornithine, in a reaction catalyzed by the enzyme ornithine 
decarboxylase (ODC, EC 4.1.1.17), or formed indirectly from arginine in a reaction sequence 
initiated by arginine decarboxylase (ADC, EC 4. 1 . 1 . 1 9). Putrescine formed by tlie ADC and/or ODC 
pathway serves as precursor in the synthesis of the higher polyamincs, spermine and spermidine. 

30 catalyzed by the enzymes spemiine synthase and spermidine synthase, respectively, or it is converted 
to N-methylputrescine by the action of putrescine N-methyltransferase (PMT). the first committed 
step in nicotine biosynthesis (Hashimoto and Yamada, 1994; Kutchan, 1995; Chattopadhyay and 
Ghosh. 1998). N-methyl putrescine is oxidized by a diamine oxidase and cyclized to form the 1- 
methyl-A'-pyrrolium cation, which is condensed with nicotinic acid or its derivative to form nicotine 
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(Hashimoto and Yamada, 1 994), 

Putrcscenc is a precursor for N-methylputrescine, which then forms N-methylpyrroline. 
Conversion of putrescine to N-methylputrescine is catalyzed by the enzyme putrescine 
N-methyltransferase ("PMT"), with S-adenosylmethionine serving as the methyl group donor. PMT 
appears to be the rate-limiting enzyme in the pathway supplying N-methylpyrroline for nicotine 
synthesis in tobacco (Fcth ct al., "Regulation in Tobacco Callus of Enzyme Activities of the Nicotine 
Pathway". Planta, 168, pp. 402-07 (1986); Wagner et al„ "The Regulation of Enzyme Activities of 
the Nicotine Pathway in Tobacco", Physiol. Plant.. 68. pp. 667-72 (1986)). 

II. TRANSGENIC PLANTS 

The methods of nicotine formation in tobacco and the genes involved have been studied both 
to better understand differential gene expression during tobacco growth and development, and also to 
discover tools useful for creating transgenic plants. For example, the regulatory sequences that 
modify protein expression in tobacco may be useful in creating transgenic tobacco or other 
transgenic plants. 

It has already been demonstrated that tissues of many plant species may be transformed by 
exogenous, typically chimeric, genes which are effective to stably transform cells of the tissues. For 
several species, tissues transformed in this fashion may be regenerated to give rise to whole 
transqenic or genetically engineered plants. The engineered traits introduced into the transgenic 
plants by these techniques have proven to be stable and have also proven to be transmissible through 
normal Mendellian inheritance to the progeny of the regenerated plants. One such desirable trait is 
the production in the plant cells of desired gene products in vivo in the cells of the transqenic plants. 
For a chimeric gene to be effective, the foreign DNA sequence containing a coding region should be 
flanked by appropriate promotion and control regions. Commonly used plant cell transcription 
promoters include the nopaline synthase promoter from the T-ONA of A. tumefaciens and the 358 
promoter from the cauliflower mosaic virus. 

In order for the newly inserted chimeric gene to express the protein for which it codes in the 
plant cell, the proper regulatory signals must be present and in the proper location with respect to the 
gene. These regulatory signals include a promoter region, a 5* non-translated leader sequence and a 3' 
polyadenylation sequence. A promoter is a DNA sequence that directs the cellular machinery of a 
plant to produce RNA from the contiguous structural coding sequence downstream (3') to the 
promoter. The promoter region influences the rate at which the RNA product of the gene and 
resultant protein product of the gene is made. The 3' polyadenylation signal is a non-translated region 
that functions in 
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the plant cells to cause the addition of polyadenylale nucleotides to the 3' end of the RNA to enable 
the mRNA to be transported to the cytoplasm and to stabilize the mRNA for subsequent translation 
of the RNA to produce protein. 

Other plant cell transformation techniques are directed toward the direct insertion of DNA 
5 into the cytoplasm of plant cells from which it is taken up, by an uncharacterized mechanism, into 
the genome of the plant. One such technique is electroporation, in 

which electric shock causes disruption of the cellular membranes of individual plant cells. Plant 
protoplasts in aqueous solution when subject to electroporation will uptake DNA from the 
surrounding medium. Another technique involves the physical acceleration of DNA, coated onto 

10 small inert particles, either into reqenerable plant tissues or into plant germlinc cells. 

The availability of cloned nucleic acid sequences encoding an enzyme involved in alkaloid 
synthesis allows for the potential manipulation of alkaloid contents. Fuithcrmore, the availability of 
promoters useful for expressing genes in plants allows for the creation of chimeric molecules and 
transgenic plants, which in turn result in possible native plant production of desirable proteins. 

15 Previously reported work discloses cloning nucleotides encoding proteins involved in the 

biosynthesis of nicotine, and isolating such proteins. Approximately twenty or more cDNAs and/or 
genomic DNA fragments encoding different enzymes involved with alkaloid formation have been 
isolated (Chattopadhyay and Ghosh, 1998). For example, successful cloning of partial or full-length 
cDNA encoding GDC activity from tobacco was disclosed by (Malik et al, 7. Plant Biochem, 

20 ScBiotech, 5.109-1 12 (1996)). Also, a relatively crude preparation of PMT (30-fold purification) has 
been subjected to limited characterization (Mizusaki et ah, "Phytochemical Studies on Tobacco 
Alkaloids XTV. The Occurrence and Properties of Putrescine N-Methyltransferase in Tobacco 
Plants", Plant Cell Physiol, 12, pp. 633-40 (1971)). A process for purifying PMT is disclosed in US 
Patent No. 5,369,023, "Method of purifying putrescine n-methyltransfcrase from tobacco plant 

25 extract with an anion exchange medium", hereby incorporated by reference in its entirety herein. 
Several laboratories have reported the cloning of partial or full-length cDNAs encoding ADC (Bell 
and Malmberg .1990; Rostogi et al., 1993; Perez-Amador et al., 1995; Nam et al., 1997; Watson and 
Malmberg, 1996). Comparisons of the amino acid sequences of ADC from various plants revealed a 
high degree of conservation among the various proteins, as well as homology to ODC (Malmberg et 

30 al., 1998). 

It is an object of the present invention to characterize the nucleotide and amino acid 
sequences of enzymes involved in the biosynthesis of nicotine in tobacco. It is also an object of the 
present invention to provide plant promoter regions that are capable of conferring high levels of 
transcription in rapidly dividing cells of transformed plants when coupled with a heterologous coding 
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sequence in a chimeric gene. Further, the invention is directed to chimeric genes incorporating such 
promoter regions, stable transfection of plants with these chimeric genes, and the plants and cells that 
are transfected, as well as seeds of such transfected plants. It is a further object to characterize sense 
and antisense nucleotides capable of regulating expression of genes encoding for enzymes involved 
5 in the biosynthesis of alkaloids. 

SUMMARY OF THE INVENTION 
Proteins involved in the biosynthesis of nicotine in tobacco N. tabacum are the subject of this 
invention. More specifically, the invention concerns four variants of putrescine N-methyltransferase 
10 (PMTl, PMT2, PMT3, and PMT4), two variants of arginine decarboxylase (ADC 1 and ADC2), 
ornithine decarboxylase (ODC), S-adenosylmethionine synthetase (SAMS), NADH dehydrogenase, 
and phosphoribosylanthranilate isomerase. 

BRIEF DESCRIPTION OF THE FIGURES 

15 Figure 1. Genomic DNA gel blot analysis of the PMT gene family in N, tabacum cv. Xanthi. 

Total genomic DNA (30 fig) was digested with Kpnl, £coRI, or £coRI and Apnl, separated by 
agarose gel electrophoresis, and transferred to nylon membranes. The membrane was hybridized 
with a ^2p-iabeled antisense strand probe covering the complete coding region of the NtPMTla 
cDNA. Identity of the hybridizing bands as detennined by comparison to phage DNA digests is 

20 indicated. Molecular weights are given in kb. Note that Kpnl shifts only the NtPMTlb band in the 
gel blot because this restriction site is present ony in Exon 1 of NtPMTlb and not NtPMTla, 

Figure 2. Amino acid sequence alignment of N. tabacum PMTs. Shown is a PILEUP alignment of 
the predicted amino acid sequences of the various tobacco PMTs. Amino acid residues that differing 

25 among the PMTs are shaded. NtPMTla, NtPMT2, NtPMT3, and NtPMT4 refer to the deduced 
amino acid sequences of the PMTs encoded by the NtPMTla, NtPMTl, NtPMTS, and NtPMT4 
genes, respectively, isolated from N tabacum cv, Xanthi genomic DNA; cNtPMTla is the predicted 
amino acid sequence of the A41 1 cDNA (Accession No. D28506) isolated from N tabacum cv. 
Burley 2 1 by Hibi et al. (1994). The location of the exon-intron boundaries are indicated by the dark 

30 vertical line. The nucleotide sequences for NtPMTla, NtPMT2, NtPMTS, and NtPMT4 appear in 
GenBank under the accession numbers AF126810, AF126809, AF126811, and AFI268I2, 
respectively 

Figure 3. Polyacrylamide gel electrophoresis analysis of PCR amplified genomic DNA fragments 
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encoding Exon 1 of PMT from various species of Nicotiana. PCR amplification was earned out as 
described in the Materials and Methods using Exon 1-specific primers 1 and 2 and total genomic 
DNA isolated from K tabacum, K otophora, and N. tomentosiformis. The amplification products 
were separated by electrophoresis on 6.5% polyacrylamide gels, the gels fixed, and subject to 
5 autoradiography. The amplification products isolated from N. tabacum cv. Burley 21 and iV. tabacum 
cv. Xanthi were identical and only the amplication products from the reactions with N. tabacum cv. 
Burley 21 DNA are shown. Standards were generated in identical reaction conditions primed with 
plasmid DNA encoding the various PMT genes isolated in this study. 

10 

Figure 4, Nucleotide sequence alignment of the 5*-flanking regions of the N. tabacum PMT genes. 
Shown is a PILEUP alignment of the nucleotide sequences upstream of the initiating methionine 
(MET) codon of the four /Wr genes isolated from K tabacum cv. Xanthi. The proposed start site 
for transcription of the NtPMTla gene is indicated by the +1 above the sequences. The TATA~box 

15 and CCAAT-box motifs are boxed. Potential transcriptional regulatory elements identified by 
MOTIF search programs are also boxed and indicated by the following abbreviations:. PAL: 
palindromic sequences; G-Box: G-Box homologous sequences; MRE: metal-responsive 
element homolog. Nucleotides identical in three or more sequences are shaded. The polyguanine- 
rich region is underlined. Numbering is indicated to the right and is relative to the proposed start site 

20 of each gene. 

Figure 5. RNA gel blot analysis of PA/T transcript levels in various tissue:v Total RNA was isolated 
from various tissues of mature N. tabacum cv. Burley 21 and analyzed by gel blot analysis using a 
^^P-labeled NtPMTla cDNA coding region (Exons 2 to 8) probe capable of detecting all PMT 
transcripts. 

25 A. PMT transcript levels in various tobacco plant tissues and/or organs. 

B. Induction ofPA/Texpression in tobacco roots following topping. Abbreviations: HP, wild-type 
(NicINic2) Burley 21; LP, low alkaloid {niclnic2) mutant. The P-subunit of mitochondrial 
ATPase (P-ATPase) served as a control. 

30 Figure 6. Semi-quantitative RT-PCR analysis of /'MT gene expression in roots of tobacco plant 
before and after topping. 

A. Shown is relative abundance of the individual PMT gene U-anscripts before and after topping. RT- 
PCR was carried out as described in the Material and methods using Exon 1 specific primers. 
Messenger RNA was amplified from total RNA isolated from the roots of wild-type (HP, 



i 
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NiclNicl) Burley 21 and low alkaloid (LP, niclnicl) Burley 21 tobacco plants. Far right lane 
rq>resents size standards for the genes isolated by PCR amplification from plasmid DNA. The p- 
subunit of mitochondrial ATPase (P-ATPase) served as a control. 

5 B. Bar graphs showing relative expression of the individual PMT genes following topping in both HP 
and LP tobacco roots. Abbreviations: HP, wild-type (NiclNicl) Burley 2 1 ; LP, low alkaloid 
{niclnic2) mutant 

Figure 7. The nucleotide and predicted amino acid sequences of the transcribed portions of the N. 

10 tabacum cv Xanthi NtADCl and NtADC2 genes. Shown are the complete nucleotide and predicted 
amino acid sequence of the K tabacum cv Xanthi NtADCl gene and where it differs from the 
NtADC2 gene sequence. The dots indicate nucleotide sequence identity and the stars indicate amino 
acid sequence identity. The proposed polyadenylation signal is underlined. The sequences terminate 
at the point of polyadenylation found in the full length cDNA (Wang, 1999; AF127239).The 

15 complete nucleotide sequences for the A^. tabacum cv Xanthi NtADCl (AF127240) and NtADC2 
(AF 127241) including the 5* and 3' flanking sequences appear in Genbank. 

Fig. 8. Comparison of the predicted amino acid sequences of arginine decarboxylases (ADCs) from 
various species. Shown is a PILEUP alignment of the predicted amino acid sequence of the N 
20 tabacum cv Xanthi NtADCl gene (AF127240) aUgned to the predicted ADC protein sequences from 
N sylvestris (AB12873), Arabidopsis thaliana (AF009647), Avena sativa (oat) (X56802), 
Lycopersicon esculentum (tomato) (L16582) and Escherichia coli (M3 1770). Amino acid residues 
conserved among the various ADC are shaded. 

25 Fig. 9. Gel blot analysis of ADC transcript levels in the roots of wild-type and low alkaloid mutant 
Burley 21 tobacco before and after topping. Total RNA was isolated from the roots of mature wild- 
type and low alkaloid mutant N, tabacum cv. Burley 21 and analyzed by gel blot analysis using [a- 
"P]-dCTP labeled probes recognizing the coding region of ADC or the P-subimit of tobacco 
mitochondrial ATP synthase (Boutiy and Chua, 1985). Quantitation was carried out by 

30 phosphorimaging using a Molecular Dynamics Phosphorlmager. Values were normalized relative to 
the intensities of the atp2 control band in each lane. The experiment was conducted twice with 
different total RNA samples. 



WO0067558 f file:/A\dcwas03\firmdata\lp\FolevPat\PatentDocuments\WQ0067558.CPC] 



WO 00/67558 



PCTAJSOO/12450 



Pag e 10 of 103 



Fig. 10. Differential expression of NtADC-1 and NtADC-2 in various tissues of tobacco. Expression 
of the NtADC-1 and NtADC-2 genes was analyzed using semi -quantitative RT-PCR and gene 
specific primers capable of discriminating between transcripts arising from the two genes. Panel A 
shows control reactions demonstrating primer specificity in the PCR reactions using plasmids 

S containing the NtADC-1 and NtADC-2 coding sequences. The niunbers above the lane refer to the 
specific primer combinations as described in the Materia and methods. Panel B shows the results of 
RT-PCR reactions using first strand cDNA synthesized from total RNA extracted from either root, 
leaf, or flowers. As a internal control, primers specific for the atp2 gene transcript were include in the 
amplification reactions. All reactions were carried out within the linear range of template 

10 amplification as determined by varying template amount, amplification time, and temperature as 
described in Riechers and Timko (1999). 

Fig. 11. Genomic DNA gel blot analysis of the ODC gene family in N, tabacum. Total genomic 
DNA (30 /ig) was digested with EcoBl or HindlUy fractionated by agarose gel electrophoresis, 
15 transferred to nylon membranes and hybridized with an a-^^P-dCTP labeled probe encoding full- 
length ODC cDNA as described in the Materials. The mobility of molecular weights standards are 
given to the right of the figure in kilobases (kb). 

Fig 12. Comparison of the nucleotide and predicted amino acid sequences of the NtODC-l and 
20 NtODC'2 genes. Shown are the nucleotide and predicted amino acid sequences of the NtODC-l 
(AF233850) and NtODC-2 (AF233849) genes. In the figure, the complete amino acid sequence of 
the pODC2 is given and the pODCl sequence is given only where it differs. The start site of 
transcription is designated as +1 and the poly(A) addition site is indicated by the arrow. Within the 
relevant regions of homology, nucleotide differences between the NiODC-I and NtODC-2 genes are 
25 in bold lettering. The proposed TATA-box, and polyadcnylation signal are shaded. 

Fig. 13. Protein sequences alignment of ornithine decarboxylases (ODCs) from various species. 
Shown is a PILEUP alignment of the predicted amino acid sequences of the K tabacum cv. Xanthi 
pODC2 protein (AF233849) with the ODCs from K tabacum cv. SC58 (Y10472) and cv. BY-2 
30 (ABO31066), Lycopersicon esculentum (tomato) (AF030292), Datura stramonium (jimsonweed) 
(X87847), Saccharomyces cerevisiae (NP_012737), and humans {Homo sapiens; AAA59966). 
Amino acid residues conserved among the various ODCs are shaded. 
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Fig. 14. Gel blot analysis of ODC transcript levels in various tissues of mature tobacco plants and 
in the roots before and after topping. Total RNA was isolated from various tissues of mature 
tabacum cv. Burley 21 and analyzed by gel blot analysis using an a-^^P-dCTP labeled coding region 
probes for ODC. (A) Transcript levels in various oi^ans of wild-type tobacco: R, root: S, stem ; L, 
leaf ; SE, sepal; PE, petal; O, ovary; S, stamen; and AN, anther. (B) Transcript levels in roots of 
Burley 21 tobacco plants before and after topping. RNA gel blot analysis of the tissueis-specific 
distribution and post-topping expression of transcripts encoding ODC in tobacco,. As a control, the 
blots were also probed with radioactively labeled probes encoding the alkaloid biosynthesis enzyme 
putrescine N-methyltransferase (PMT) and a root specific P-glucosidase (TBG-1). 

DETAILED DESCRIPTION OF THE INVENTION 
Nucleic acid sequences have been isolated from tobacco that encode important enzymes in 
nicotine and total alkaloid formation, including PMTl, PMT2, PMT3, PMT4, ADCl, ADC2, ODC, 
and SAMS. Also identified are cDNA fragments encoding partial segments of NADH 
dehydrogenase and phosphoribosilanthronilate isomerase. Also identified are promoter regions for 
the nucleotides encoding PMTl, PMT2, PMT3, PMT4, and ADC2. All of these nucleic acids are 
isolated from Nicotiana tabacum L. 

"Promoter" and "'promoter region" are terms used interchangeably herein to refer to a DNA 
sequence that regulates expression of a selected DNA sequence operably linked to the promoter, and 
which effects expression of the selected DNA sequence in cells. The term also encompasses the 
5'untranslated region that may be transcribed into mRNA but is not translated. 

"Protein", "polypeptide", and "peptide" are used interchangeably herein when referring to a 
gene product. 

In one aspect, the invention features isolated nucleic acid molecules encoding for PMTl, 
PMT2, PMT3, PMT4, ADCl, ADC2. ODC, and SAMS, a fragment of NADH dehydrogenase and a 
fragment of phosphoribosilanthronilate isomerase. The disclosed molecules can be non-coding (e.g. 
probe, antisense or ribozyme molecules) or can code for a frmctional enzyme. In one embodiment, 
the nucleic acid molecules can hybridize to the nucleic acid sequences encoding for PMTl, PMT2, 
PMT3, PMT4, ADCl, ADC2, ODC, SAMS, a fragment of NADH dehydrogenase, or a fragment of 
phosphoribosilanthronilate isomerase or to the complements of these nucleic acid sequences. In a 
preferred embodiment, the hybridization is conducted under mildly stringent or stringent conditions. 

In ftirther embodiments, the nucleic acid molecule is at least 50%, 60%, 70%, 80% and more 
preferably at least 90% or 95% homologous in sequence to the nucleic acid sequences encoding for 
PMTl, PMT2, PMT3, PMT4, ADCl, ADC2. ODC, SAMS, a fragment of NADH dehydrogenase, or 



wo 00/67558 



PCTAJSOO/12450 



10 

a fragment of phosphoribosilanthronilate isomerase or to the complements of these nucleic acid 
sequences. In another embodiment, the nucleic acid encodes a polypeptide that is at least 50%, 60%, 
70%, 80% and more preferably at least 90% or 95% similar in sequence to the amino acid sequence 
of PMTl, PMT2, PMT3, PMT4, ADCl, ADC2, ODC. SAMS, the fragment disclosed herein of 
NADH dehydrogenase, or the fragment of phosphoribosilanthronilate isomerase disclosed herein. 

In another embodiment, the invention features isolated polypeptides, preferably substantially 
pure preparations, encoded for by the nucleic acid sequences of the invention. Particularly preferred 
are those polypeptides encoded for by the nucleic acid sequences identified by (SEQ. E). NO. 2), 
(SEQ. ID. NO. 5), (SEQ. ID. NO. 8), (SEQ. ED. NO. 11), (SEQ. ID. NO. 13), (SEQ. ID. NO. 15), 
(SEQ. ID. NO. 18), (SEQ. ID. NO. 2 1), (SEQ. ID. NO. 23), (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26) 
or comprising a nucleotide sequence encoding the amino acid sequence encoded by (SEQ ID NO. 3), 
(SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), 
(SEQ. ID. NO. 19), (SEQ. ID. NO. 22) or (SEQ. ID. NO. 24). In particularly preferred 
embodiments, the subject polypeptides can aid in regulating the production of alkaloids, particularly 
nicotine, in plants. In one embodiment, the polypeptide is identical to or similar to the protein 
represented by the amino acid sequences of (SEQ ID NO. 3), (SEQ. ID. NO. 6), (SEQ ID. NO. 9), 
(SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), (SEQ. ID. NO. 19), (SEQ. ID. NO. 22) 
or (SEQ. ID. NO. 24). In a preferred embodiment, the polypeptide is encoded by a nucleic acid that 
hybridizes with a nucleic acid represented in. 

The polypeptides of the present invention can comprise full length proteins, such as represented 
by (SEQ ID NO. 3), (SEQ. ID. NO. 6). (SEQ ID. NO. 9), (SEQ. ID. NO. 1 2), (SEQ. ID. NO. 14), 
(SEQ. ID. NO. 16). (SEQ. ID. NO. 19), (SEQ. ID. NO. 22) and (SEQ. ID. NO. 24) . or can comprise 
one or more fragments corresponding to one or more particular motifs/domains, or to arbitrary sizes, 
e.g., at least 5, 10, 25, 50, 100, 150, or 200 amino acids in length. 

Another aspect of the invention features chimeric genes comprised of a promoter for the genes 
for PMT2, PMTl, PMT3, PMT4, or ADC2. Yet another aspect of the invention features chimeric 
genes or chimeric molecules comprised respectively of the functional gene encoding for or the 
protein PMTl, PMT2, PMT3, PMT4, ADCl, ADC2, ODC, SAMS, NADH dehydrogenase and/or 
phosphoribosilanthronilate isomerase. 

The invention also concerns isolated and purified promoter regions for tobacco Beta* 
glucosidase and their use in chimeric genes or chimeric molecules. 

Another aspect of the invention involves vectors capable of transporting another nucleic acid to 
which a vector has been linked. Preferably, the vectors comprise the nucleic acid sequences of the 
invention or their complements. 
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The invention also features transgenic plants and their seeds that include (and preferably 
express) a heterologous form of PMTl. PMT2, PMT3, PMT4, ADCl, ADC2. ODC, SAMS, NADH 
dehydrogenase and/or phosphoribosilanthronilate isomerase. The present invention also 
encompasses transgenic plants that contain in their genome a chimeric gene construction 
5 incorporating the nucleic acid encoding PMTl, PMT2, PMT3, PMT4, ADCl, ADC2, ODC, SAMS, 
NADH dehydrogenase and/or phosphoribosilanthronilate isomerase. Such transgenic plants and their 
seeds may be useful to natively produce enhanced quantities of desirable exogenous proteins, such as 
compounds useful for pharmaceutical purposes, or proteins that provide herbicide resistance. 

Another feature of the invention is the use as probes of the DNA sequences disclosed herein or 
10 oligonucleotide fragments thereof. Probes may be useful to obtain additional gene family members 
or locate homologous genes in tobacco or other plant species. Copies of related genes can be 
obtained from existing genomic libraries or the genomic libraries can be constructed. In one 
embodiment, an isolated DNA sequence comprising about a fifteen to about a twenty-five base pair 
oligonucleotide sequence identical to any consecutive about fifteen to about twenty-five base pair 
15 sequence found in the sequences of the invention is used as a probe. 

Another feature is use of the polypeptides of the invention in an assay, such as an assay to 
identify modulators of enzyme activity in plants. 

Other featiures and advantages of the invention will be apparent to those of skill in the art. 
The nucleotide and amino acid sequences of the invention are disclosed herein in the Sequence 
20 Listing, text, and the figures. Specific sequences of the invention are provided in the attached 
Sequence Listing and can be understood to represent promoters, nucleic acids, and proteins 
respectively relating to the following proteins: PMT2 (SEQ. ID. NOS. 1, 2, and 3); PMTl (SEQ. ID. 
NOS. 4, 5, and 6); PMT3 (SEQ. ID. NOS. 7, 8, and 9); PMT4 (SEQ. ID. NOS. 10, 1 1, and 12); 
SAMS (SEQ. ID. NOS. 13 and 14 ); ODC (SEQ. ID. NOS. 15 and 16); ADCl (SEQ.. ID. NOS. 17, 
25 18, and 19); ADC2 (SEQ. ID. NOS. 20, 21, and 22); ADCl mRNA (SEQ. ID. NOS. 23 and 24); 
NADH dehydrongenase (SEQ. ID. NO. 25); and PAI (SEQ. ID. NO. 26). If only two sequence 
identifiers are provided for a protein, those sequences represent the nucleic acid and the protein 
respectively. Ifthree identifiers are provided, the identifiers represent promoter, genomic or cDNA 
nucleic acid, and peptide sequences, respectively. If only one identifier is provided, it represents a 
30 DNA fragment coding for the protein or portions of it. 

For other reference, the sequences may be found at the following records in GenBank at the 
following Accession Numbers, which records are hereby incorporated in their entirety herein: 
AF126810 (NtPMTl); AF126809 (NtPMT2); AF126811 (NtPMT3); AF126812 (NtPMT4), 
AF 176908 (NtomPMT)(Nicotiana tomentosiformis); AF76909 (NotoPMT)(Nicotiana otophora); 
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AF127239 (ADC); AF127240 (ADCl); AF127241 (ADC2); AF127242 (ODC); AF233849 (ODC2); 
AF233850 (ODCl); and AF127243 (SAMS). 

The following experimental discussion is presented to better illustrate the invention. 
I. PMT 

S The present invention features the characterization of four members of the nuclear gene family 

encoding PMT in tobacco N. tabacum. The nucleic acid sequences encoding PMT and the amino 
acid sequences for the proteins are reported herein and can also be found in the DDBJ> EMBL, and 
GenBank Nucleotide Sequence Databases under the accession numbers for NtPMTIa, NtPMT2, 
NtPMTS, mANtPMT4 at AF126810, AF126809. AF126811, and AF126812, respectively. The 
10 complete coding region and immediate 5'- and 3'- flanking regions are characterized. 

As the discussion below shows, all four PMT genes present in the M tabacum genome are 
expressed in the roots of wild-type plants and differentially regulated in tobacco lines expressing 
either high or low total alkaloid contents. 

1 5 Materials and Methods 

Plant materials 

Seeds of ^. sylvestriSy N. otophora, and N, tomentosiformis were obtained from the USDA-ARS 
20 national tobacco germplasm collection (Oxford, NC). N. tabacum cv. Burley 21 and N, tabacum cv. 
Xanthi seeds were kindly provided by Glenn Collins, University of Kentucky. Tobacco plants used 
for DNA isolation were grown in a soilivermiculite mixture in the greenhouse under natural lighting 
conditions. Plants used for RNA extraction were grown in Moltan Plus (Moltan Co., Middleton, 
TN). 

25 

Gel blot analysis of genomic DNA 

Young leaves were collected from greenhouse grown tobacco {N, tabacum cv. Xanthi) plants and 
total genomic DNA was isolated from freshly-harvested tissues using a modification of the CTAB 
30 extraction method (Dellaporta et al, 1983). Approximately 30 ^g of total DNA was digested with 
£coRI, Kpnl, or £coRI and Kpn\ and the digestion products separated by electrophoresis through a 
0.75% agarose gel. Restricted and size-fractionated DNA was denatured and transferred to Nytran+ 
nylon membranes (Schleicher and Schuell, Keene, NH) by capillary blotting in 0.4N NaOH 
overnight. Membranes were prehybridized in 0.25M Na2HP04 (pH 7.4), 7% SDS, 1 mM NajEDTA 
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for at least 2 hr, then hybridized overnight at 63 ""C in the same buffer with 2-3 x 10^ cpm/mL of a 
^^P'labeled single*stranded probe (antisense DNA strand). The probe was prepared by the method of 
Bednarczuk et al, (1991) using a primer (Table 1, primer 4) designed from the 3' end of the 
NtPMTJa coding region (Exon 8) and the full-length coding region of the NtPMTla cDNA as 
5 template. The NtPMTla cDNA was generated by RT-PCR using synthetic oligonucleotide primers 
based on the N- and C-terminal sequences of the A41 1 cDNA reported by Hibi et aL (1994) and 
RNA template isolated from N, tabacum cv. Burley 21 roots. Membranes were washed at a final 
stringency of 0.1 x SSC, 0.1% SDS at 65 "C. Hybridizing bands were visualized by autoradiography 
and/or imaged using a Molecular Dynamics Phosphorlmager (Model 445 SI, Sunnyvale, CA). 

10 

Genomic library construction and phage isolation 

A library of M tabacum cv. Xanthi genomic DNA fragments constructed in EMBL3 was purchased 
from Clontech (Palo Alto, CA) and a total of 1.1 x 10^ recombinant phage were screened by plaque 

15 hybridization using random-primed ^^p-iabeled NtPMTla cDNA as probe (Sambrook et aL, 1989). 
Prehybridization, hybridization, and washing conditions were as described above. Positive 
hybridizing phage were plaque purified by subsequent rounds of rescreening and DNA was prepared 
from 1 8 independently isolated phage. The phage DNA was characterized by restriction analysis and 
DNA gel blot analysis and fragments containing the sequences encoding PMT were subcloned into 

20 pBluescript KS vectors for further analysis. 

Comparison of the hybridizing fragments present in the 18 recombinant phage to the 
hybridization pattem obtained by genomic DNA blot analysis indicated that only three of the PMT 
genes suspected to be present in the N, tabacum genome were recovered from the library screen. To 
obtain sequences encoding NtPMTla, a subgenomic libraiy was constructed from N tabacum cv. 

25 Xanthi DNA. The library consisted of gel-purified 2.5-3.5 kb £coRI fragments ligated into X_ZAP 
II vector arms and packaged using Gigapack III Gold packaging extracts according to the 
manufacturer's instructions (Stratagene, La Jolla, CA). The primary library was amplified once in £. 
coli XL 1 -Blue MRF' strain (Stratagene) and screened as described above, except that a random- 
primed 32p-iabeled NtPMTla cDNA Exon 1-specific probe was used (Table 1). Exon 1 had 

30 previously been amplified by PCR using primers 1 and 2 (Table 1) and the NtPMTla cDNA as 

template. The recombinant phage that hybridized with the probe was isolated from the sublibrary by 
two more rounds of plaque purification, and the pBluescript phagemid containing the approximate 
3. 1 kb EcoBl genomic fragment with the NtPMTla gene was excised from the X_ZAP II phage 
vector using the in vivo excision protocol described by Stratagene. 
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DNA sequence analysis 

Unless otherwise noted, DNA sequencing was performed with double-stranded plasmid DNA 
5 templates using fluorescent dye terminator technology (dRhodamine Terminator Cycle Sequencing 
Ready Reaction kit) on an ABI 310 DNA sequencer (Perkin-Elmer Applied Biosystems). For 
analysis of PCR products, following electrophoretic separation of amplification reaction products, 
the bands of interest were excised from the polyacrylamide gels, the DNA extracted using the 
Quiagen Gel Extraction Kit, and the recovered DNA used as sequencing template. Sequencing was 

10 performed using AmpliTaq DNA polymerase and fluorescent dye terminator technology (as 
described above) and primers 1 and 2 (Table 1) specific for Exon 1. Nucleotide and amino acid 
sequences were analyzed and aUgned using either the Clustal method and Lasergene software 
(DNAStar Inc., Madison, WI) or the PILEUP and ALSCRIPT (Genetics Computer Group, Madison, 
WI) sequence analysis package (Version 9.0). Transcription factor binding site homologies were 

IS identified in promoter DNA sequences by searching the transcription factor database using the GCG 
program. 

Rl^A gel blot analysis 

20 For RNA analysis, roots and other tissues were harvested from mature wild-type (HP; NiclNicl) and 
low alkaloid mutant (LP; niclnic2) Burley 21 tobacco plants. For topping experiments, the stem was 
cut and the top one*third of the plant was removed just prior to flower opening. Roots were 
harvested just prior to topping (0 hr control) and at various times after decapitation. The tissue was 
immediately frozen in liquid nitrogen and stored at -80*" C until RNA extraction and isolation. 

25 Total RNA was isolated from vegetative organs and floral structures of HP and LP Burley 2 1 

tobacco using the TRI-reagent (Molecular Research Center Inc., Cincinnati, OH) and quantified 
spectrophotometrically by measuring i4260. Total RNA (S |xg) was electrophoresed through L2% 
agarose gels (containing 0.4 M formaldehyde) and transferred to Nytran+ nylon membranes. 
Following prehybridization the membranes were hybridized with a single-stranded NtPhfTla cDNA 

30 antisense probe (corresponding to the antisense strand of Exons 2 to 8 of the NtPMTla cDNA coding 
region) as described above. As a control to quantify and normalize RNA levels in each lane, the blot 
was hybridized with a 400-bp probe derived from the P-ATPase cDNA using primers 6 and 7 (Table 
1) as described below. 



WOQ067558 [ file:/A\dcwas03\firmdata\lp\FolevPat\PatentDocuments\WO0067558.CPC ] 



WO 00/67558 



PCT/USOO/12450 



Page 17 of 103 



15 

Semi-quantitative RT-PCR analysis of individual PMT transcript levels 

Total RNA (1 fig) extracted from the roots of HP and LP Burley 21 tobacco plants was reverse- 
transcribed into first-strand cDNA at 42*^0 using Superscript II reverse transcriptase (Gibco BRL) 

S according to the manufacturer's protocol Two gene-specific primers were employed in the reactions: 
primer S capable of recognizing Exon 3 of the PA/T genes and primer 8 specific for Exon 8 of the 
nuclear gene encoding the P-subunit of mitochondrial ATPase from N. plumbaginifolia (NpATP2.1) 
and N. sylvestns (NsATP2J) (Boutry and Chua, 1985; Lalanne et al, 1998). The p-ATPase 
transcript served as an internal reference (constitutively-expressed control) to determine loading 

10 accuracy and to normalize expression levels (Kinoshita et a/., 1992) Following first strand cDNA 
synthesis, two sets of nested primers (0.4 fiM each primer) were used to amplify the PMT and P- 
i47'PaJ€ transcripts: primers 1 and 2 (Table 1) recognized Exon 1 in all five PA^' transcripts and 
gave products ranging in size from 220 bp to 420 bp and primers 6 and 7 amplified an approximately 
400-bp region encompassing a portion of Exons 6 to 8 of the P-ATPase coding region. 

15 Amplification was carried out for 25 cycles using the following reaction conditions: denaturation at 
95°C for 1 min, primer annealing at 60*'C for 35 sec, and extension at 72°C for 1.5 min; a final 
extension was conducted at 72°C for 6 min. Amplification products were radioactively labeled by 
spiking the PCR reaction with 10 ^iCi 32p-dCTP. Aliquots of the PGR reaction were analyzed on a 
6.5% non-denaturing polyacrylamide/lX TEE gel and electrophoresed at 600 volts. The reaction 

20 conditions were optimized to provide amplification of both PMT and $-ATPase transcripts in the 
linear range of the reaction by varying the levels of first strand cDNA template, annealing 
temperature, and number of cycles of amplification as described in Kinoshita et al (1992). 
Molecular weight standards were prepared by PCR amplification using the same primers and 
protocol described above and plasmid DNA templates containing the PMT encoding genomic 

25 fragments, as well as genomic DNA from the various Nicotiana species indicated in the text. 

Following electrophoresis, the polyacrylamide gels were fixed in 5% MeOH, 7.5% acetic acid 
for 30 min, dried ovemigjit, and used to expose X-ray film. PMT band intensities were quantified 
using phosphorimager analysis (Molecular Dynamics) and normalized relative to the intensities of 
the P-ATPase control band in each lane. The experiment was conducted twice with different total 

30 RNA samples, and representative results are presented firom one of the two experiments. 

Results 

PMT gene structure and organization in N. tabacum 
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Gel blot analysis of total genomic DNA isolated from N. tabacum cv. Xanthi, hybridized with a 
radioactively-labcled cDNA (NtPMTla) encoding the conq)lete coding region of putrescine N- 
methyltransferase (PMT) showed the presence of five major hybridizing bands in Kpnl or £coRI 
digested DNA, consistent with the presence of a small multigene family in the K tabacum genome 
5 (Figure 1). 

As part of our initial characterization of the gene family encoding PMT in N. tabacum, an 
EMBL3 genomic library, prepared from tabacum cv. Xanthi DNA, was screened using the 
NtPMTla (M\ I homologous) cDNA as probe. From a total of 18 recombinant phage isolated, three 
phage were recovered that contained genomic fragments encoding the NtPMT2, NtPMT3 and 

1 0 NtPMT4 genes. The three PMT genes were completely encoded within a unique sized EcoRl 

fragment within the phage DNA insert which allowed for the correlation of each with a hybridizing 
restriction fragment on the gel blot of M tabacum genomic DNA (Figure 1). The complete coding 
region and immediate 5' and 3' non-coding sequences of the three genes were determined and found 
to encode full-length PMT proteins (Figiu-e 2). Each PMT gene consisted of 8 exons and 7 introns, 

15 consistent with the gene structure reported previously for the PMT genes from A^. sylvestris 

(Hashimoto et aL, 1998a). Comparison of the deduced amino acid sequences (Figure 2) revealed 
that the encoded PMT proteins were extremely similar over their entire length, with the only 
significant variability in primary sequence localized to the extreme N-terminal region of the protein. 
This region, completely encoded within Exon 1, contains a variable number of an 1 1 amino acid 

20 repeat with a consensus sequence of NGHQNGTSEHQ. The function of the repeated sequence is 

unknown, but is apparently inconsequential to enzyme function, since the number of repeats does not 
influence activity and PMTs characterized from other species do not contain the repeated element 
(Hashimoto et al, 1998a; Suzuki et al, 1999a). 

Multiple rounds of screening of the EMBL3 genomic library failed to yield additional 

25 hybridizing phage containing sequences encoding the other two PMT genes thought to be present in 
the M tabacum genome and, therefore, a directed cloning approach was pursued using a subgenomic 
library constructed from Ec69CL fragments isolated from N, tabacum cv. Xanthi. From this 
hybridization screening, a phage containing the approximately 3.1 kb EcoRI fragment encoding 
NtPMTla was recovered. The coding region of the NtPMTla gene was found to be identical to the 

30 A41 1 cDNA (Hibi et al, 1994), with the exception of a single base change in Exon 6 that results in a 
conservative amino acid substitution. This difference could be the result of minor differences among 
cultivars used in the two studies (i.e., Xanthi vs. Burley 21). Translation of the open reading frame 
contained in NtPMTla showed that it encoded a protein containing four N-terminal 1 1 amino acid 
repeats, similar to Exon 1 of the PMTgmt present in N, tomentosiformis (Hashimoto et aL, 1998a). 
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Given the observation that NtPMTJa encoded a homolog of the PMTgem present in N. 
tomentosiformis^ the nature and possible evolutionaiy origin of the remaining PA/7 gene present in 
the K tabacum genome was brought into question. From our expression studies (described in detail 
below), we had determined that five distinct PMT encoding transcripts were present in the roots of M 
S tabacum^ four of which could be accounted for based upon the length of the Exon I coding region in 
the four PA/T genes isolated and characterized in our studies described above. The fifth transcript 
was similar in size to that encoded by NtPMTJa and, therefore, was designated NtPMTlb. Since the 
variability in PMT gene structure is primarily localized within Exon 1, we used a PCR-based strategy 
to analyze the PMT gene structure and family size in K otophora, the other possible progenitor of N. 

10 tabacum. As shown in Figure 3, five distinct PCR products were detected in the electrophoretic 
pattern of amplification products generated fi*om A^. tabacum genomic DNA using Exon 1 specific 
primers (Table 1). Consistent with our studies described above and the previous work of Hashimoto 
et al. (1998a), three PCR products were detected in the electrophoretic pattern of amplification 
products generated fi'om N. sylvestris genomic DNA, and a single band was recovered fi-om N. 

15 tomentosiformis genomic DNA. Amplification of genomic DNA fi'om N. otophora using Exon 1 
specific primers also yielded only a single band, whose electrophoretic mobility was most similar to 
that of the NtPMTlb derived product. 

Analysis of PMT gene intron and flanking sequences 

20 

The location of the seven introns within the protein coding region of the five PMT genes in N. 
tabacum is identical and appears to be conserved among PMT genes firom different Nicotiana 
species. There is also little variation in the nucleotide sequences that comprise tb.e Exon*Intron 
splice junctions in the various PMT genes in N tabacum (Table 2). The high degree of nucleotide 

25 sequence similarity recognized among PMT genes within their coding regions is also present within 
their introns and immediate 5* and 3* flanking sequences (Table 2 and Figure 4). In general, a greater 
level of sequence identity is found in the introns of the NtPMTl, NtPMTSy and NtPMT4 genes, than 
in pair- wise comparisons among the introns of the other members of the N. tabacum PMT gene 
family. The observed conservation in the intron sequences of the NtPMTl, NtPMT3, and NtPMT4 

30 genes is consistent with their origin from the same progenitor species (M sylvestris). One interesting 
exception occurs within Intron 6, where the length of the intron and the sequence similarity is more 
conserved between NtPMTla and NtPMT4, than between NtPMT4 and NtPMT2 or NtPMTS. 

Approximately 1 kb of nucleotide sequence was determined 5' to the coding regions of the 
NtPMTla, NtPMT2, NtPMTS, and NtPMT4 genes (Figure 4). By comparison to the 5'-untranslated 
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region (UTR) present in the A41 1 cDNA, we set the start site for transcription initiation at 
approximately 57 nucleotides upstream of the MET start codon in NtPhfTla and NtPMTS, and either 
69 or 60 nucleotides upstream in NiPMT2 and NtPMT4. The major distinguishing feature between 
the 5'-UTRs in the various genes is the presence or absence of a 17 bp sequence in the gene. An 
5 appropriately placed TATA-box can be easily recognized 45 bp 5' to the initiation site in all four 

genes. Within the first 200-250 bp upstream of the TATA box, a high level of sequence conservation 
is found to exist among the promoter regions in the four genes. After this point, a clear difference 
can be observed between the NtPMTla promoter and the remaining three genes, and by 400 bp 
upstream, little similarity can be found among any of the gene family members. 

10 Analyzing the proximal regions of the various PA/T promoters with various motif scanning 

software identified several G-box-like sequences (Foster et al., 1994; Kim et aL, 1992; Menkens et 
al, 1995; Staiger et ai, 1989; Williams et aL, 1992) at various positions among the PMT promoters, 
and a potential metal response element (MRE) (positions -75 to -66; numbering relative to the 
NtPMTla promoter sequence) in three of the four PA/Ts (Cizewski-Culotta and Hamer, 1989; Thiele, 

15 1992). An unusual 17 nucleotide stretch of guanine occurs at positions -259 to -243 in the NtPMTla 
gene promoter followed upstream by a purine-rich region (positions -332 to -263). In the NtPMTS 
promoter a 14 bp palindromic sequence (positions -497 to -484) was detected. PMT gent expression 
has been reported to increase in root tissues following treatment with methyl jasmonate (Imanishi et 
al.y 1998). However, none of the sequence motifs reported to confer methyl jasmonate- 

20 responsiveness in other plant genes (Mason et aL, 1993; Rouster et al, 1997) were detected in the 
PA/T promoters. 

Comparison of the available nucleotide sequence information from the 3'-flanking regions of 
the various PMT genes in N, tabacum revealed that the 3 -UTRs in the NtPMT2, NtPMT3, and 
NtPMT4 genes of N. tabacum share approximately 81-94% identity with each other and are 

25 essentially identical to those reported for N sylvestris PMTs by Hashimoto et al, (1998a). The major 
distinguishing feature among the various genes is the presence of two short (20 bp and 4 bp) 
deletions in the NtPMTl gene, which lowers the percent identity. The 3'-UTR of NtPMTla is 
identical to that reported for the A41 1 cDNA (Hibi et al, 1994) and 81-94% identical to the other 
PMT genes in the N tabacum genome. Unfortunately, no sequence information is currently 

30 available for the 3'-UTR of the N. otophora or N, tomentosiformis PMT genes. 

Regulation of PMT gene expression 



To determine whether the members of the PMT gene family in N tabacum were differentially 
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expressed, a series of experiments were carried out to define the temporal and spatial distribution of 
transcripts arising from the five genes. Shown in Figure 5A are the results of gel blot analysis of 
total RNA extracted from various tissues of mature Burley 21 tobacco plants hybridized with 
radioactively-labeled probe capable of detecting all five PA^T transcripts. Consis'cent with previous 
5 studies (Hashimoto et al, 1998b; Hibi et a/., 1994), PhfT expression is localized exclusively to roots. 
When maturing wild-type (HP) Burley 21 plants are topped (i.e., the floral meristem and upper 1/3 of 
the stem are removed), a dramatic increase in PA^ transcript abundance is observed within 2 hr, 
reaching a maximal level of accumulation by 12-24 hr. Two size transcripts are detected on the gel 
blots, reflecting the small difference in message size that occurs as a result of the difference in size of 

10 Exon 1 among the genes. 

In addition to examining PA/Tgene expression in wild-type plants, we also examined 
expression in a low nicotine-producing (LP) mutant of Burley 21 (Legg and Collins, 1971). The low 
nicotine Burley 21 line harbors mutations at two independent loci {nicl and nic2) thought to be 
global regulators of gene expression involved in alkaloid formation. As shown in Figure 6B, topping 

15 of the low nicotine mutant (nicl nicl) Burley 21 did not cause an increase in i'MJ transcript 

abundance as observed in wild type plants. Thus, it appears that Nicl and Nicl are likely involved in 
regulation of PMT expression in the very least, and may also be involved in the regulation of other 
genes in the alkaloid biosynthetic pathway. Whether this is a direct effect (e.g., transcriptional 
activation) or indirect remains to be determined. 

20 In order to determine the extent to which the individual members of the gene family 

contributed to the general pattern of expression described above, a semi-quantitative RT-PCR 
strategy (Kinoshita et a/., 1992) was used to detect and quantify the levels of the individual PhfT 
transcripts in the roots of both wild-type (HP) and low alkaloid (LP) Burley 21 tobacco. This 
approach takes advantage of the fact that Exon 1 is variable in length among the various PAfJ genes 

25 (Figure 2), allowing for their individual detection and quantitation following polyacrylamide gel 
electrophoresis and autoradiography. 

Five RT-PCR products (representing Exon 1 from each of the five genes present in N, tabacum) 
were detected in the electrophoretic profiles of amplification products derived from reactions using 
either HP or LP Burley 21 root RNA (Figure 6 A). All five i'A/T genes present in the N tabacum 

30 genome were expressed in the roots of wild-type plants, and topping resulted in a differential 

accumulation of transcripts derived from each gene. Among the five genes, transcripts derived from 
the NtPMTl and NtPMTJb showed the greatest increase in abundance rising approximately 3-fold 
during the first 24 hr post-topping, whereas levels of the NtPMTl a and NtPMT4 transcripts changed 
little in response to topping (Figure 6B). In the LP mutant, little or no effect was observed on the 
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levels of the various PMT transcripts following topping, although in some cases (e.g., NtPMTla) a 
small but likely insignificant depression in transcript abundance was detected. Thus, it appears that 
all five genes contribute to PMT activity levels within the root. 

5 11. ADC 

The present invention featiu-es the characterization of two members of the nuclear gene family 
encoding ADC in tobacco M tabacum L. As the following discussion shows, ADC2 is preferentially 
expressed in roots and accounts for the major portion of ADC transcripts present. Furthermore, 
analysis of ADC transcript levels in roots of low and high nicotine producing lines showed that ADC 
10 expression is under the control of the NicJ Nic2 regulatory loci. 

Materials and methods 

Plant growth and tissue preparation 

15 

Seeds ofN. tabacum cv. Xanthi, wild-type and low alkaloid nicJ nic2 mutant A^. tabacum cv. Burley 
21 were obtained from Dr. G. Collins (University of Kentucky, Lexington). Tobacco plants used for 
DNA isolation were grown in soil.vermiculite mixture in the greenhouse under natural lighting 
conditions. Plants used for RNA extraction were grown either in Moltan Plus (Moltan Co., 

20 Middleton, TN) or hydoponically in a dilute (half-strength) Peters nutrient solution with continuous 
aeration of the roots under natural lighting conditions in the greenhouse. Topping experiments were 
conducted by removing the floral meristem, leaves and stem (approximately the upper 1/3 of the 
plant) from tobacco plants just prior to blooming. Plant tissues were collected from fully matured 
individuals, frozen in liquid nitrogen, and stored at -80°C until used for RNA preparation (see 

25 below). 

Screening of genomic libraries and phage characterization 

A genomic library constructed in A EMBL3 from K tabacum cv. Xanthi leaf DNA (Clonetech, Inc., 
30 Palo Alto, CA) was screened by plaque hybridization (Sambrook et al, 1989) using an [a- ^^P]- 

dCTP-labeled, 2,7 kb Eco^-Xhol fragment from plasmid PR24 as probe. PR24 encodes a full length 
ADC cDNA isolated from the roots of wild-type N. tabacum cv. Burley 21 (Wang, 1999). 
Hybridization was performed at 65 °C for 16 h in a solution containing 0.25 M Na2HP04 (pH 7.2) 
and 7% (w/v) SDS. Following hybridization, the membranes were washed twice in 2 x SSC, 0.1% 
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SDS for 15 min at room temperature, once in 0,2 x SSC, 0.1% SDS for 30 min at eS^'C. Hybridizing 
phage were picked and plaque purified through three subsequent rounds of hybridization screening. 
Phage DNA was isolated from plaque purified phage using a Qiagen Phage Midi Preparation Kit 
(Qiagen, Germany) and insert DNA characterized by restriction mapping and DNA gel blot analysis. 
5 The relevant hybridizing bands in each phage were cloned into pBIuescript SK+ vectors for further 
analysis. 

Nucleic acid sequencing and analysis 

10 Nucleotide sequencing was carried out manually using the Sequcnase Version 2.0 protocols 

according to the manufacturer's protocol G^nited States Biochemical, Cleveland, OH) or with an ABI 
310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA) using double-stranded plasmid 
DNA templates prepared utilizing the Qiaprep Spin Plasmid Kit (Qiagen USA, Valencia, CA). The 
nucleotide and predicted amino acid sequences of the various cDNAs were analyzed using BLAST 

15 sequence analysis programs (Altschul et al, 1990; Gish and States, 1993) and protein sequence 
alignments were carried out using the PILEUP program (Genetics Computer Group Sequence 
Analysis package, Version 9,0 (GCG, University of Wisconsin, Madison, WI) and the various gene 
sequences available in the NCBI (National Center for Biotechnology Information, Bethesda, MD) 
nucleotide and protein sequence database. Manual adjustment of the sequence alignments were 

20 carried out as necessary. 

RNA isolation and gel blot analysis 

Total RNA was extracted from tobacco roots, leaves, and floral parts using Tri-Reagent 
25 (Molecular Research Center, USA, Cincinnati, OH) according to the manufacturer's protocol. For 
RNA gel blot analysis, aliquots (10 ^g) of total RNA extracted from the various tissues were 
fractionated by electrophoresis through a 1.2% agarose-formaldehyde gel and blotted onto Nytran 
nylon membranes (Schleicher & Schuell, Keene, NH) using 10 X SSC. The transferred RNA was 
UV cross-linked to the membrane using a UV Stratalinker (Stratagene, La Jolla, CA) and the 
30 membranes were prehybridized in 7% SDS, 0.25 M Na2HP04. pH 7.2 for 2-4 hours at 65 ""C. 

Hybridization was carried out in the same buffer in the presence of "P-labeled probes for 16 hr at 
65**C. The membranes were washed under high stringency conditions and subject to 
autoradiography at -80°C for approximately 48 h. 

For gel blot analysis, [a- "P]-dCTP -labeled probes were prepared by random primed labeling 
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(Random Primed Labeling Kit, Boehringer Mannheim, Indianapolis, IN) using 25-50 ng of a 2.7 kb 
EcoKL'Xhol fragment derived from PR24 and a 460 bp fragment amplified from the P- subunit of the 
tobacco mitochondrial ATP synthase gene (atp2) (Boutry and Chua, 1985). 

5 Semi-quantitative RT-PCR analysis of NtADCl and NtADC2 transcript levels. 

Total RNA (2 jig).from roots, leaves, or floral parts was reverse transcribe at 40*0 for 1 h in a 
reaction cocktail containing 200 units of Superscriptn reverse transcriptase (RNase H-. Gibco BRL, 
USA), 10 units RNase inhibitor (Perkin Elmer). 200 ^m dNTPs and 40 pmol of primer, in total 

10 volume of 20^1. For first strand cDNA synthesis, a single primer [5 

AGAAAAACATCACCAACT-3'] capable of hybridizing to both the ADCl andADC2 transcripts 
was used in the inaction. As a control, a primer ( 5'-GCAACTGTCATCTTATCATCTTC-3') 
specific for the p-subunit of the tobacco mitochondrial ATP synthase gene apt! (Boutry and Chua, 
1985) was used in the reverse transcriptase reaction. 

15 Following reverse transcription, the single stranded cDNA products were serially diluted over a 

concentration range between 1 to 50 ng RNA, and PGR amplification was carried out for 25 cycles 
of 45 s at 94°C, I min at 64 "^C and 1 min at ITC in a Genemate thermocycler (ISC Bioexpress, 
UT). The reaction mixture contained cDNA template, 1 x PCR buffer (Boehringer Mannheim), 100 
^M dNTPs, 25 pmol of each forward and reverse primer and 1 unit Taq DNA polymerase. The PCR 

20 reactions specific for ADCl transcripts contained the following primers: ADC 1 -forward, 5'- 
CGTAGACGCTACTGTTTCO' and ADCl-reverse, 5'-TGGACAAC TGTGGAGGCG-3\ 
Reactions specific for ADCl transcripts contained primers ADC2-forward, 5'- 
TGTAGATGCTGCTGTTGTTT-3', and ADC2-reverse, 5'-TGAACAAC TGCGGAGGCA-3\ 
Control reactions for normalization of amplification products contained 25 pmol of primers specific 

25 for the tobacco apt2 transcripts: atp2 forward, 5 *-GTATATGGTCAAATGAATGAGCC-3\ and atp2 
reverse.int, 5'-GCAGTATTGTAGTGATCCTCTCC-3'. For quantitation purposes, amplification 
reactions were supplemented with IjiCi '^P-dCTP. PCR products were separated by electrophoresis 
through 1.2% agarose gels, the fractionated reaction products transferred onto a Hybond N+ 
membranes, dried and subject to autoradiography at -70° C. Quantitation was carried out by 

30 phosphorimaging using a Molecular Dynamics Phosphorlmager. Values were normalized relative to 
the intensities of the atp2 control band in each lane. The experiment was conducted twice with 
different total RNA samples, and representative results are presented from one of the two 
experiments. 
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Results 

These studies show the structure and expression of individual members of the ADC gene 
family in tobacco. An a- "P-dCTP-labeled 2.7 kb EcoRI-XhoI fragment from PR24 encoding the 

S ADC coding region was used to screen an X EMBL3 phage genomic library. From a screen of 

approximately 3 XIO* phage, seventeen hybridizing phage were recovered, of which five were fully 
characterized by restriction mapping and DNA gel blot analysis. These phage fell into two groups 
based on their restriction profile. The relevant hybridizing fragments from the various phage were 
cloned into pBluescript and their nucleotide sequence determined. 

10 Presented in Figure 7 are the nucleotide and predicted amino acid sequences of NtADC-1 and 

NtADC-2 genes. Both genes contain a single open reading frame, uninterrupted by introns. The 
nucleotide and amino acid sequence encoded in NtADC-1 is identical to that of PR24, the full length 
cDNA isolated from N, tabacum cv Burley 21. There are 84 nucleotide differences within the 
NtADC-1 and NtADC-2 coding regions, resulting in 23 amino acid differences between the ADCl 

15 and ADC2 proteins, respectively. The ADCl protein is one amino acid shorter in length, missing 
Val-13. 

By comparison to the full-length cDNA, the 5 '-untranslated region (UTR) present in 
NtADC-1 and NtADC-2 are 431 bp and 432 bp long, respectively. The size of the 5*-UTR in the 
ADC transcripts is considerably larger than the average size of the plant leader sequence (Joshi, 
20 1987). In contrast, the 3 '-UTRs present in NtADC- 1 and NtADC-2 are relatively short, 

approximately 84 nucleotides in length. In both gene sequences, a conserved polyadenylation signal 
(AATAATA) can be recognized 23 nucleotides from the site of polyadenylation site found in the 
PR24 cDNA. 

Pairwise comparison of the N, tabacum ADCl and ADC2 proteins with the ADCs of other 
25 plant species showed that the K tabacum proteins are approximately 82% identical to the ADC of 
its evolutionary progenitor species N, sylvestris [Genbank Accession No. AB012873] and 86% 
identical to the ADC from tomato (Lycopersicon esculentum) [31], another member of the 
Solanaceae family (Figure 2). As might be expected, the K tabacum ADC shares considerably less 
similarity to ADCs isolated from species more distantly related evolutionarily, such as Arabidopsis - 
30 67% identical [32, 33], soybean- 67% identical [34], and oat - 42% identical [35] and is only 29% 
identical to the enzyme from Escherichia coli - [36]. 

The predicted protein coding regions for the N. tabacum ADCs are substantially longer than 
those reported for the ADC proteins of M sylvestris and L esculentum [31], but are similar in length 
to those reported in Arabidopsis, oat, soybean [32-35] and for the E. coli enzyme [36]. The 
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difference in overall length appears to arise from an apparent nucleotide deletion in the N. sylvestris 
and tomato cDNA sequences relative to the ADC 1 and ADC2 predicted sequence and those in other 
plants. In the nucleotide sequences reported for both the N, sylvestris and tomato cDNAs, a guanine 
residue (position 2295 in the N, sylvestris sequence and 1531 in the tomato sequence) is missing 

5 [Genbank Accession No. AB012873]. This deletion changes the reading frame and introduces a 
premature termination to the predicted coding region. Using the sequence inforrnation available in 
the NCBI database, correcting for this error allowed us to extend the predicted C-terminus of the 
both ADC proteins, yielding the alignment to the N, tabacum ADCs and those of other plant ADCs 
as indicated in Figure 8. We have also included in the alignment shown in Figure 8, the correction at 

10 the N-terminus of the predicted tomato ADC protein sequence noted by P^rez-Amado et al. [37], 
allowing better alignment of all of the higher plant sequences. 

Developmental regulation of arginine decarboxylase expression 

15 It has been shown that nicotine formation can be activated in the roots of maturing tobacco 

plants by topping, that is, removal of the flower head and several young leaves (Akehurst, 1981; 
Hibi, et al, 1994). Coincident with the activation of nicotine formation, there is an increase in the 
levels of transcripts encoding ODC, PMT and spermidine synthase (SPS) over the subsequent 24 hr 
period in wild-type plants (Hibi et al., 1994; Riechers and Timko, 1999). To determine the effects of 

20 topping on ADC expression in roots, Burley 21 plants were grown in the greenhouse to the bud stage 
at which point the upper 1/3 of the plant was removed and samples of roots tissues were collected 
before and at various times post-topping. As shown in Figure 9, i4£)C message abundance increased 
in the roots of topped Burley 21 plants during the 24 hr period after topping. Low alkaloid (LA) 
mutants of Burley 21 show a much lower level of ADC expression in their roots, and no induction of 

25 ADC transcript accumulation after topping. The lack of ADC induction in the low-alkaloid mutant is 
consistent with previous studies (Hibi et al„ 1994; Riechers and Timko, 1999; Wang, 1999) showing 
a general inability to activate gene expression leading to increased polyamine fomiation and alkaloid 
biosynthesis as a result of the mutation of the NicI and Nic2 regulatory genes. 

30 NtADC'2 is predominately expressed in roots of wild-type plants. 

Due to the high degree of identity between the NtADC-1 and NtADC-2 transcripts (e.g., 95.8% 
coding regions, 94.4% and 96.4% in 5'- and 3'-UTRs, respectively), it is impossible to distinguish 
between the two transcripts by RNA gel bot analysis. Therefore, we employed a RT-PCR based 
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strategy and gene specific oligonucleotide primers. Total RNA was extracted from tobacco roots, 
leaves and flowers, and single-stranded cDNA synthesized using an oligonucleotide primer capable 
of hybridizing to both ADCl and ADC2 transcripts. As an internal control for amplification, a gene 
specific primer recognizing the atp2 transcript encoding the P-subunit of the tobacco mitochondrial 
5 ATPase was include in the reactions. Under experimental conditions providing amplification in the 
linear range of the PGR reaction, gene specific forward and reverse primers were used to specifically 
amplify either ADCl or ADC2 cDNAs. Test reactions (Figure lOA) using plasmid DNA encoding 
NtADCl or NtADC2 as template demonstrated the specificity of the primers. As shown in Figure 
lOB, the main transcripts detectable in all tissues tested are derived fi-om NtADC-2. Flowers express 

10 the highest level of ADC, and leaves lowest. In the flowers, although ADC 1 is detectable, far less 
than ADC2 Roots also express a significant level of ADC. 

ADC transcript levels are highest in the roots and floral organs, and low in other plant tissues. 
The two ADC genes investigated appear to have different modes of regulation, with ADC2 being 
predominately expressed in the roots and other organs. 

15 At the present time, only limited information is available on the nature of regulatory regions 

in the promoters of genes encoding enzymes of alkaloid biosynthesis. The availability of cloned 
genomic fragments encoding ADC allows one to begin mapping regulatory sequences within 
members of these genes responsible for tissue specific, developmental, and inducible expression. 

20 



III. ODC 

The present invention features the genes of two members of the nuclear gene family encoding 
ODC in tobacco M tabacum. As the following experimental discussion shows, the ODC-2 gene is 
25 preferentially expressed in roots and floral tissues. Furthermore, the abundance of ODC transcripts in 
root tissues is affected by topping. Furthermore, analysis of ODC transcript levels in roots of low 
and high nicotine producing lines shows that ODC expression is under the control of the Nicl Nic2 
regulatory loci. 
Materials and methods 

30 

Plant growth and tissue preparation 

Seeds of M tabacum cv. Xanthi, wild-type and low alkaloid nicJ nicl mutant K tabacum cv. Burley 
21 were obtained from Dr. G. Collins (University of Kentucky, Lexington). Tobacco plants used for 
DNA isolation were grown in soihvermiculite mixture in the greenhouse under natural lighting 
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conditions. Plants used for RNA extraction were grown either in Moltan Plus (Moltah Co., 
Middleton, TN) or hydroponically in a dilute (half-strength) Peters nutrient solution with continuous 
aeration of the roots under natural lighting conditions in the greenhouse. Topping experiments were 
conducted by removing the floral meristem, leaves and stem (approximately the upper 1/3 of the 
5 plant) from tobacco plants just prior to blooming. Floral parts and other plant tissues were collected 
from fully matured individuals, frozen in liquid nitrogen, and stored at -80**C until used for RNA 
preparation (see below). 

Screening of genomic libraries and phage characterization 

10 A genomic library constructed in EMBL3 from N. tabacum cv. Xanthi leaf DNA (Clonetech, Inc., 
Palo Aho, CA) was screened by plaque hybridization (Sambrook et al, 1989) using a"P- 
radiolabeled, 1.6 kb Eco^-Xhol insert from plasmid PR46 as probe. PR46 encodes a full length 
ODC cDNA previously isolated by differential screening of plasmid libraries prepared from mRNA 
isolated from the roots of wild-type Burley 21 plants before and 3-days post-topping (Wang, J., 

15 Sheehan, M., Bookman, H, and Timko, M.P., unpublished data). Hybridization was performed at 
65 °C for 16 h in a solution containing 0.25 M Na2HP04 (pH 7.2) and 7% (w/v) SDS. Following 
hybridization, the membranes were washed twice in 2 x SSC, 0.1% SDS for 15 min at room 
temperature, once in 0.2 x SSC, 0.1% SDS for 30 min at 65*0. Hybridizing phage were picked and 
plaque purified through three subsequent rounds of hybridization screening. Phage DNA was isolated 

20 from plaque purified phage using a Qiagen Phage Midi Preparation Kit (Qiagen USA, Valencia, CA) 
and insert DNA characterized by restriction mapping and DNA gel blot analysis. The relevant 
hybridizing bands in each phage were cloned into pBluescript SK+ vectors for further analysis. 

Nucleic acid sequencing and analysis 

25 Nucleotide sequencing was carried out manually using the Sequenase Version 2.0 protocols 

according to the manufacturer's protocol (United States Biochemical, Cleveland, OH) or with an ABI 
310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA) using double-stranded plasmid 
DNA templates prepared utilizing the Qiaprep Spin Plasmid Kit (Qiagen USA, Valencia, CA). The 
nucleotide and predicted amino acid sequences of the various cDNAs were analyzed using BLAST 

30 sequence analysis programs (Altschul et al, 1990; Gish and States, 1993) and protein sequence 
alignments were carried out using the PILEUP program (Genetics Computer Group Sequence 
Analysis package. Version 9.0 (GCG, University of Wisconsin, Madison, WI) and the various gene 
sequences available in the NCBI (National Center for Biotechnology Information, Bethesda, MD) 
nucleotide and protein sequence database. Manual adjustment of the sequence alignments were 
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carried out as necessary. 

RNA isolation and gel blot analysis 

Total RNA was extracted from tobacco roots, leaves, and floral parts using Tri-Reagent 
5 (Molecular Research Center, USA, Cincinnati, OH) according to the manufacturer's protocol. For 
RNA gel blot analysis, aliquots (10 ^g) of total RNA extracted from the various tissues were 
fractionated by electrophoresis through a 1.2% agarose-formaldehyde gel and blotted onto Nytran 
nylon membranes (Schleicher & Schuell, Keene, NH) using 10 X SSC. The transferred RNA was 
UV cross-linked to the membrane using a UV Stratalinker (Stratagene, La Jolla, CA) and the 

10 membranes were prehybridized in 7% SDS, 0.25 M NajHPO^, pH 7.2 for 2-4 hours at 65 **C. 

Hybridization was carried out in the same buffer in the presence of ^^P-labeled probes for 16 hr at 
65 ''C. The membranes were washed under high stringency conditions and subject to 
autoradiography at - 80** C for approximately 48 h. 

Restriction fragments derived from cDNA clones of interest were separated by agarose gel 

15 electrophoresis, the DNA was purified, and quantified by spectrophotometry. [^^P]-dCTP -labeled 
probes were prepared from 25-50 ng of insert DNA by random primed labeling (Random Primed 
Labeling Kit, Boehringer Mannheim, Indianapolis, IN). As a control, the blots were also probed with 
radioactively labeled probes encoding the alkaloid biosynthesis enzyme putrescine N- 
methyltransferase (PMT) (Riechers and Timko, 1999), a root specific, topping inducible P- 

20 glucosidase encoding cDNA (TBG-1) (Riechers, D.E. and Timko, M.P., unpublished data), 268 
rRNA (PR3 1) or 288 rRNA fragments. 

Genomic DNA isolation and gel blot analysis 

Tobacco genomic DNA was prepared from tobacco leaf tissue by the method of Junghans and 

25 Metzlaff (1 990). Total genomic DNA (\5\ig) was digested to completion with EcoRl or HindlU, the 
digestion products were fractionated by electrophoresis through a 0.8% (w/v) agarose gel, and 
transferred onto Nytran nylon membrane (Schleicher & Schuell, Keene, NH) in the presence of 0.4 N 
NaOH (Sambrook et al, 1989). Following transfer, the membrane was rinsed in 2 X SSC , the DNA 
was UV cross-linked to the membrane, and the membrane was prehybridized and hybridized as 

30 described above. Following hybridization and washing, the membranes were subjected to 
autoradiography at -BO'^C. 

Results and discussion 
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Gel blot analysis of tobacco genomic DNA cut with various restriction enzymes and 
hybridized with an [a- "P]-dCTP-labeled 1.6 kb EcoRl-Xhol cDNA fragment (PR46) encoding the 
full-length ODC protein from N. tabacum cv Burley 21 (Wang, J., Sheehan, M,, Bookman, H. and 
Timko, M.P., unpublished data) indicated ODC is encoded by small gene family in the N. tabacum 
5 genome (Fig. 1 1). Four to five major bands and several minor bands of sufficient size to encode full- 
length genes are detected in either EcdSl or Hindni digested tobacco DNA. 

To further analyze the structure and regulation of members of the ODC gene family in 
tobacco, a X EMBL3 phage genomic library constructed with DNA firom M tabacum cv Xanthi was 
screened using a [a- ^^P]-labeled probes prepared from PR46 (as described above). From a screen of 
10 approximately 3 XIO^ phage, five hybridizing phage were recovered, of which three were fully 
characterized by restriction mapping and DNA gel blot analysis. Two phage proved to contain 
identical insert DNA and the third had a unique restriction digestion profile. Following DNA gel blot 
analysis, the hybridizing fragments were cloned into pBluescript and their nucleotide sequence 
determined. 

15 The complete NtODC-2 gene spans two Sah fragments of 2.7 kb and 6.5 kb. The coding 

region of the gene contains a single 1302 bp open reading frame uninterupted by introns (Fig. 12). 
The nucleotide sequences of NtoDC'2 is identical within the coding and 5' and 3'- untranslated 
regions to the PR46 encoded cDNA, with the exception of four nucleotide changes (residues +2, +4, 
+6 and +8) in the 5'-untranslated region. These nucleotide differences likely reflect changes 

20 introduced during the cDNA synthesis reaction. 

The predicted amino acid sequence for the NtODC'2 encoded protein (designated pODC2) 
(Fig. 13) is identical to the ODC characterized firom Burley 21 tobacco encoded by PR46 (Wang, J., 
Sheehan, M., Bookman, H. and Timko, M.P., unpublished data) and to the partial N, tabacum ODC 
cDNA sequence (PR17) reported by Malik et al, (1996). Comparison of the predicted amino acid 

25 sequence for pODC2 with the ODC proteins characterized from two different tobacco cultivars 

showed that the pODC2 differs by 7 amino acid (98% identity) from the ODC protein characterized 
from the high alkaloid cultivar, N. tabacum cv. SC58 [Genbank Accession No. Y10472.1] and by 7 
amino acid (98% identity) ftom ODC protein from BY-2 cells. The tobacco pODC2 is 89% and 
90% identical to the ODCs from tomato (Lycopersicon esculentum) and jimsonweed {Datura 

30 stramonium), respectively, but substantially less similar to ODCs from yeast (35% identity) and 
humans (32% identity). 

The NtODC'l gene, contained on an 4,0 kb Xbal fragment, encodes a single open reading 
frame of 141 amino acids encompassing the amino terminal one-half of ODC (Fig. 12). Six amino 
acid residue changes distinguish the NtODC-2 and NtODC-J encoded proteins over the homologous 
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region of the proteins. Beginning at amino acid residue 130, the NtODC-l encoded protein (pODCl) 
diverges from pODC2, with a stop codon present after residue 141. Scanning the available 
nucleotide sequence (> 1 kb) in the 3*-flanking region of the NtODC-l gene failed to reveal any 
evidence for ODC homologous protein sequences in any of the three translational reading frames. 
5 Interestingly, a comparison of the 5*-flanldng sequence of the NtODC-l and NtODC-2 genes 
revealed that while the NtODC-2 gene has a clearly recognizable TATA-box properly located at 
approximately -35 bp from the transcriptional start site, no such regulatory motif is found in the 
NtODC'l gene sequence. Consistent with this observation, RNA gel blot analysis performed using a 
hybridization probe prepared from NtOCD-I immediately downstream of the frame shift, failed to 

10 detect any message in various tissues of mature tobacco plants (data not shown). Thus, it appears that 
NtODC-2 represents an unexpressed pseudogene in the N, tabacum genome. 

To determine the spatial pattern of expression of the NtODC'2 gene, gel blot analysis was 
carried out using total RNA prepared from roots, stems, young and mature leaves, and various floral 
parts of Burley 21 tobacco plants. As shown in Fig 14, transcripts encoding ODC were easily 

IS detected in the roots, with little or no expression in other tissues except sepals, carpels, and mature 
stamens. 

The formation of nicotine and total leaf alkaloids in tobacco is known to be under the control 
of at least two independent genetic loci (Legg et aLy 1969; Legg and Collins, 1971), designated NicJ 
and Nic2 (Hibi et aL, 1994). NicI and Nic2 are semidominant and operate synergisticially to control 

20 plant alkaloid content, with mutations within these genes resulting in plants with reduced levels of 
nicotine and total leaf alkaloids (wild-type > nicl > nic2 > nicl nic2) (Legg et aL, 1969; Legg and 
Collins, 1971). Although no information is available on the nature of their encoded products, it has 
been sjseculated that Nicl and Nic2 likely encode transcriptional regulators capable of globally 
interacting with a subset of genes encoding components of polyamine and alkaloid biosynthesis 

25 (Hibi et aL, 1994). Removal of the flower head and several young leaves (i.e., topping) leads to 

activation of nicotine formation in the roots of decapitated plants (Akehurst, 1981; Hibi et aL, 1994). 
To determine the effects of topping on NtODC-I expression in roots, Buriey 21 plants were grown in 
the greenhouse to the bud stage at which point the upper 1/3 of the plant was removed and samples 
of roots tissues were collected before and at various times post-topping. As shown in Fig 14B, low 

30 levels of the ODC transcripts were found in roots prior to topping and message abundance increased 
approximately 2-fold in the roots of topped Burley 2 1 plants 4 hr after topping. By 24 hr after 
topping, ODC transcript levels return to their initial levels. Low alkaloid mutants of Buriey 21 
subjected to the same treatment show a much lower level of stimulation of ODC transcript 
accumulation after topping, and the enhanced transcript abundance does not persist beyond 4 hr. By 
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comparison, transcripts encoding PMT and and a tobacco root-specific P-glucosidase (TBG-1) show 
patterns of accumulation similar to that observed for ODC transcripts in wild-type plants, but no 
induction in the low-alkaloid mutant, consistent with previous studies (Hibi et al, 1994; Riechers 
and Timko, 1999; Wang, 1999). 

5 

IV. SAMS 

A single recombinant phage is identified as encoding for SAMS. This X phage contains an 
approximately 15kB Sail insert. Restriction mapping and PCR analysis indicates that the insert 
DNA contains primarily the coding and 3'non-coding portions of the SAMS gene. The nucleotide 
10 sequences for the gene encoding SAMS can be found at GenBank Accession Nos. AF27243 (fiill 
length SAMS cDNA). 

V. NADH dehydrogenase 

A fragment of the cDNA encoding for NADH dehydrogenase in K tabacuum shows high 
15 expression in the roots of mature wild-type HP plants compared to low alkaloid mutant LP plants. 

VI. Phosphoribosylanthranilite isomerase (PAI) 

The gene encoding for a fi*agment of phosphoribosylanthranilite isomerase in N. tabacuum is 
a homolog of the Arabidopsis thaliana gene encoding PAI, an enzyme involved in tryptophan 
20 biosynthesis. This enzyme is involved in the overall formation of aromatic compounds in plants. 
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What is claimed is: 

I. An isolated DNA molecule comprising the nucleotide sequence of (SEQ. ID. NO. 2), (SEQ. ID. 
NO. 5), (SEQ. ID. NO. 8), (SEQ. ID. NO. 1 1), (SEQ. ID. NO. 13), (SEQ. ID. NO. 15). (SEQ. ID. 

5 NO. 18), (SEQ. ID. NO. 21), (SEQ. ID. NO. 23), (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26) or 

comprising a nucleotide sequence encoding the amino acid sequence encoded by (SEQ ID NO. 3), 
(SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), 
(SEQ. ID. NO. 19), (SEQ. ID. NO. 22) OR (SEQ. ID. NO. 24). 

10 2. A vector comprising the isolated DNA molecule of claim 1 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

3. An isolated DNA molecule comprising a DNA sequence complementary to the nucleotide 
sequence of claim 1. 

15 

4. A vector comprising the isolated DNA molecule of claim 3 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

5. A cultured transgenic tobacco cell stably transformed with the vector of claim 2. 

20 

6. A cukured transgenic tobacco cell stably transformed with the vector of claim 4. 

7. A transgenic tobacco plant stably transformed with the vector of claim 2. 

25 8. A transgenic tobacco plant stably transformed with the vector of claim 4. 

9. The isolated DNA molecule of claim 1 , wherein the isolated DNA molecule comprises the 
nucleotide sequence of (SEQ ID NO:). 

30 10. A vector comprising the isolated DNA molecule of claim 9 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

I I . An isolated DNA molecule comprising a DNA sequence complementary to the nucleotide 
sequence of the isolated DNA molecule of claim 9. 
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12. An isolated DNA sequence comprising about a fifteen to about a twenty-five base pair 
oligonucleotide sequence identical to any consecutive about fifteen to about twenty-five base pair 
sequence found in (SEQ. ID, NO. 2). (SEQ. ID. NO. 5), (SEQ. ID. NO. 8), (SEQ. ID. NO. 11), 
(SEQ. ID. NO. 13), (SEQ. ID. NO. 15). (SEQ. ID. NO. 18). (SEQ. ID. NO. 21), (SEQ. ID. NO. 23). 

5 (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26). 

13. A cultured transgenic tobacco cell stably transformed with the vector of claim 10. 

14. A transgenic tobacco plant stably transformed with the vector of claim 10. 

10 

15. A vector comprising a DNA sequence which encodes an antisense mRNA which is 
complementary to a fragment of a mRNA encoded by the isolated DNA molecule of claim 1, 
wherein said sequence is operably linked to sequences capable of directing the transcription of said 
antisense mRNA in tobacco cells and wherein the expression of said antisense mRNA in tobacco 

15 cells is sufficient to provide for reduced nicotine content in tobacco cells which are stably 
transformed with said vector as compared to untransformed tobacco cells. 

16. A cultured transgenic tobacco cell stably transformed with the vector of claim 15. 

20 17. An isolated and purified protein comprising the amino acid sequence identified in (SEQ ID NO. 
3). (SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12). (SEQ. ID. NO. 14), (SEQ. E). NO. 16). 
(SEQ. ID. NO. 19), (SEQ. ID. NO. 22) or (SEQ. ID. NO. 24). 

1 8. A method for regulating gene expression in a plant comprising functionally linking an alkaloid 
25 gene promoter to a nucleic acid encoding a protein, wherein the promoter comprises a nucleic acid 

sequence selected from the group consisting of the sequences identified in (SEQ ID NO. 1), (SEQ. 
ID. NO. 4). (SEQ ID, NO. 7). (SEQ. ID. NO. 10). (SEQ. ID. NO. 17),and (SEQ. ID. NO. 20). 

19. The method of claim 18, wherein the nucleic acid encoding a protein encodes a protein involved 
30 in the biosynthesis of alkaloids in plants. 



20. A plant transformed by the method of claim 18. 
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NtPMT3 
NtPHTZ 
NtFMT4 
NtPMTla 



-gc tgtacaaaag gatgtctcaa atcatttgga atattaattc -1029 
ctgagttg -1039 



NtPMra 
NtPMTZ 
NtPMTI 
NtPHHa 

NtE>HT3 
NtPHT2 
NtPHr4 
NtPMTla 

NtPMT3 
NtPMT2 
NtPKT4 
NtPMTla 

NtPKr3 
NtPMT2 

NtPMTla 

NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 



NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 



NtPMT3 
NtPMT2 
NtPHT4 
NtPMTla 



tgcaatcaac aagaaatacc ccactattaa gacccattat cactggcaca aaaattatga gatcattaaa catcttaaac -949 
acaagaacaa ttcctggtga atcagatgga tgaagataat agaggtgggt ggaatctata accaaa ggag"clgg|t tgagt -959 

etgtccetat ttggaagagt gtggtatggg agatgcetcc caggagtacc taaagctgaa tatgatggaa gttttaacaa -869 
gactgtgcga gttgcagaaa caattgaagg gtcatttgtg gaatttgggg ccatttcaaa ggaaaaagaa aagatgactt •819 



aeaaattggg aagcagggat tgagggattc tcagagatga agagggaggc tttgtcatgg ctttttcgat gcctataatc *')89 
ageattaata aatcaaatta aaataaggct tagcgttaaa atcaaaggaa atggeaagcc tggctcctgg agcaatgctt -199 



tataataaca tcagtgaagc agaattgaaa gccatcaagt atgggtgtga atggtgcaaa tacaaaggaa tatcaaactt -109 
ctgaggacag tagtaaaaac aatatcagac aaaaagtaaa gttgtattat ttagcttgag gataaagtat gtcattagtt -119 



cattgtggaa 
ttgtgagaga 



cegagaaatt 
caagcccctc 



actgactcga ggatgatcta tgacatacta cagaccaaaa 
tttggtgtcc tctacaatga ttgttgaagt ccctatttat 



Stc||aagcaa caacaagttg aaacaagaga 
^cj|atacac aggaaacaaa atcctaggat 



-629 
•639 



. — g gstfcaatgg agaaggaaaa tatttccagt -625 



0-Bo« 

aatggagatt ctggac ^t qeaga acacc tgttacccat tgccttcgcg 
ttaaatgaca ataatggggt taatgatgaa tatgtagcgg catgacatga 



aagca^ 
atgcc^ 



jitca ag 
Uat tc 



ggcagac 
pcgcaac 



-549 
-559 



gtaaacacaa gtgaatgaag agaagccaaa ataatctcta tcattcaagc cttaggtgga gatta^aaa a^tttact -545 

PAZ. 



tggtttgcta 

gactatttat 



aagiggccac cagagctaac gaaggt^td 
ttaftattga ggaatatttt ttattaiad 



tct? 



i(tatCLggtg 



acaagcattc 



gtatcaaaag cggccaaggg 
gtttgcttcc gttgattacg 



-469 
-419 



ttcttatcaa agt^ataggt gatcaacagc tttcgt^a§ |gtcattagg agaatattat aatctctttt atgctgaaga -465 
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NtADCl 

ttcacgttctcttctcaattcccataaaagaaacccttccgtfai^ 
gtttccgtcctatttt--ctcttcttctacgcttc 78 
NtADC2 

c 

tc. . .a c 

..t... 80 



319 



NtADCl 

ctcttctgatatcaatatctgtatggtgtttttcttg 
ttcgaattttagatttgttttgcctttaatacctgta 
acctta 158 
NtADC2 

a 

t a. . 

160 

NtADCl 

taattctctgtttaaaccaaaaacttagcttcttctg 
aagtcagggtggggatatttggatcgtgtaagagtgt 
gttaga 238 
NtADC2 - 



239 



NtADCl 

aggtgattatcttttgattcagttccttttttgcttc 
ttttgagggggtagccggggcctcggcctcggcgggt 
tttaat 318 
NtADC2 

g 



NtADCl 

agcccccatctattacaaccattgggcaaaaacatca 
ttaaatctgtacaaagcaaacccttaatttagtttaa 
ttttct 398 
NtADC2 

t 



399 



MPALGCCVDAT- 
V S P P 
NtADCl 

gtattctttgattctttaacagaagaagaagagATGC 

CG6CCCTAGGTTGTTGCGTAGACGCTACT 

GTTTCCCCTCC 475 
NtADC2 

a t ATG. 

T T...G.. GTT 

479 

1 

M*********AV 
* * * * 

16LGYAFSRDSS 
LPAPEFFTSGVPP 
T N S A 
NtADCl 

CTCGGCTATGCCTTCTCTCGGGATAGCTCTCTTCCCG 
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CGCCGGAGTTCTTTACCTCCGGCGTACCTCCTACAAA 

CTCCG 555 

NtADC2 

. . .A 

G ..G 

...T. 559 

17 ♦S******** 

* * * * 

43 AGSIGSPDLS 
SALYGVDGWGAPY 
F S V 
NtADCl 

CCGCCGGTTCCATTGGGTCTCCGGATCTGTCCTCTGC 
TTTGTACGGGGTCGATGGGTGGGGAGCTCCTTATTTC 
TCCGTT 635 
NtADC2 

. . . .T.C T. . . .G 

. . .A 

..T... 639 

************* 

* * * 

69NSNGDISVRP 
HGTDTLPHQEIDL 
L K V V 
NtADCl 

AACTCTAACGGAGATATCTCCGTCCGACCACATGGTA 
CGGACACACTCCCCCACaGGAAATTGACCTTCTCAA 
GGTCGT 715 



PCT/USOO/12450 

21 

NtADC2 

T C 

T T 

719 

70********** 
************* 

* * * * 

96KKASDPKNSG 
GLGLQLPLVVRFP 
D V L K 
NtADCl 

GAAAAAGGCCTCCGACCCGAAAAATTCAGGGGGGCTC 

TGCTAA 795 
NtADC2 

T T 

..T.G. 799 

97********** 

* * * * 

123 N R L E S L Q S A F 
DLAVHSQGYGAHY 
Q G V 
NtADCl 

AAAACCGGTTGGAATCTCTGCAATCGGCTTTTGATCT 
CGCTGTTCATTCCCAGGGCTATGGGGCCCACTACCAA 
GGTGTT 875 
NtADC2 
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G 

. . . 879 

124 ********** 



* * * 

149 YPVKCNQDRF 
VVEDIVKFGSSFR 
F G L E 
NtADCl 

TATCCCGTGAAATGCAATCAAGACAGGTTCGTGGTGG 
AAGATATTGTCAAATTCGGGTCGTCATTCCGGTTCGG 
GTTGGA 955 
NtADC2 



C C 

959 

XSQ ********** 
*********«p«* 
* * .* * 

176 AGSKPELLLA 
MSCLCRGSAEGLL 
V C N G 
NtADCl 

AGCTGGGTCTAAACCCGAGCTCCTGTTAGCCATGAGC 
TGTCTCTGCAGGGGCAGTGCTGAGGGCCTTCTCGTTT 
GCAATG 1035 
NtADC2 

...C 

A 

1039 



177 ********** 
*****K***, 
* * * * 

203 FKDAEYISLA 
LVARKLMLNTVIV 
L E Q 
NtADCl 

GTTTCAAGGACGCTGAGTACATTTCGCTTGCTTTGGT 
TGCAAGAAAGCTCATGTTAAACACTGTAATTGTTCTT 
GAACAA 1115 
NtADC2 



G. . . 

1119 

204 ********** 
************* 

* * * 

229 EEELDLVIDI 
SRKMAVRPVIGLR 
A K L R 

NtADCl 

GAGGAGGAGCTTGACCTTGTGATTGATATAAGCCGTA 
AGATGGCTGTTCGGCCCGTAATTGGACTTCGGGCTAA 
GCTCAG 1195 
NtADC2 

A. . 

T 

1199 

230 ********** 
*}{*********** 

* * * * 



F/(ya/c7 (c) 



WOQ067558 [file:/A\dcwas03\firindata\lp\FotevPat\PatentDocuments\WO0067558.CPC] 



Page 53 of 103 



WO 00/67558 PCTAJSOO/12450 

10 / 21 



256 TKHSGHFGST 
SGEKGKFGLTTTQ 
I V R V 
NtADCl 

GACCAAGCATTCAGGCCATTTTGGATCCACTTCTGGA 
GAAAAAGGTAAGTTTGGGCTTACAACGACCCAAATTG 
TTCGTG 1275 
NtADC2 



1279 

257 **♦*♦♦*♦♦* 

* * * * 

283 VKKLEESGML 
DCLQLLHFHIGSQ 
IPS 

NtADCl 

TAGTGAAGAAGCTGGAAGAATCCGGAATGCTGGATTG 
CCTTCAGTTGCTGCATTTTCACATTGGATCTCAGATC 
CCTTCA 1355 
NtADC2 

.G A 

T 

T 1359 

284 ♦♦*♦♦♦**♦* 

* * * 



309 TALLADGVGE 
AAQIYCELI RLGA 
G M K F 

NtADCl 

ACGGCGTTGCTTGCTGATGGTGTTGGTGAGGCTGCTC 
AGATTTATTGTGAATTAATCCGTCTTGGTGCGGGTAT 
GAAGTT 1435 
NtADC2 

G A A. C 

G A 

1439 

310 *G******** 

* * * * 



336 IDTGGGLGID 
YDGTKSCDSDVSV 
G Y G I 
NtADCl 

CATTGATACTGGAGGTGGGCTCGGAATTGATTATGAT 
GGTACTAAATCATGTGATTCAGATGTCTCTGTTGGCT 
ATGGCA 1515 
NtADC2 

T T 

C T 

1519 

337 

************* 
* * * * 
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363 QEYASTVVQA 
VQYVCDRKGVKHP 
VIC 
NtADCl 

TTCAAGAATACGCCTCCACAGTTGTCCAGGCGGTTCA 
ATATGTTTGCGACCGTAAGGGCGTGAAGCACCCAGTG 
ATTTGC 1595 
NtADC2 

T G T 

A T A 

..C... 1599 

3^4 ****«;^**** 
************* 
* * * 

389 SESGRAIVSH 
HSILIFEAVSASS 
H S C S 
NtADCl 

AGCGAAAGTGGCAGGGCAATTGTTTCTCATCACTCAA 
TTCTGATTTTCGAAGCCGTGTCTGCTTCTAGTCACTC 
ATGTTC 1675 
NtADC2 



1679 

39Q ********** 

* * * * 

416 SSHLSSGGLQ 
SMAETLNEDALAD 
Y R N L 



NtADCl 

TTCTTCACATCTGTCTTCTGGTGGCCTCCAATCCATG 
GCGGAGACGCTCAATGAAGATGCCCTTGCTGATTACC 
GCAATT 1755 
NtADC2 



C 

1759 

417 *♦♦*♦*♦**♦ 

* * * * 

443 SAAAVRGEYE 
TCVLYSDQLKQRC 
V D Q 
NtADCl 

TATCTGCTGCTGCAGTTCGTGGAGAGTACGAGACGTG 
TGTACTTTACTCTGATCAGTTGAAACAGAGATGTGTG 
GATCAG 1835 
NtADC2 

T A. . 



1839 

444 Ik********* 
************* 
* * * 

469 FKEGSLGIEH 
LAAVDSICDFVSK 
A M G A 
NtADCl 

TTTAAAGAAGGGTCCTTGGGTATTGAACATCTTGCTG 
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CTGTTGATAGO^TCTGTGATTTTGTATCAAAGGCTAT 

GGGGGC 1915 

NtADC2 

1919 

470 ♦*♦♦***♦♦* 
************* 

* * * * 

496 ADPIRTYHVN 
LSIFTSIPDFWAF 
G Q L F 

NtADCl 

TGCTGATCCTATCCGCACTTACCATGTGAATCTGTCA 
ATTTTCACTTCAATTCCTGATTTTTGGGCCTTTGGTC 
AATTGT 1995 
NtADC2 

G 

1999 

497 «**v****** 
************* 

* * * * 

523 PIVPIHRLDE 
KPAVRGILSDLTC 
DSD 
NtADCl 

TTCCGATTGTTCCAATACACCGTTTAGATGAAAAGCC 
TGCAGTAAGGGGAATATTATCGGACTTGACTTGTGAC 
AGTGAT 2075 



PCT/USOO/12450 



21 



NtADC2 

T C 

G A 

2079 

524 ********** 

* * * 

549 GKVDKFIGGE 
SSLQL.HEI. GSNGD 
G G G Y 

NtADCl 

GGGAAGGTTGATAAGTTCATTGGTGGCGAATCAAGCT 
TGCAGCTGCATGAATTGGGAAGTAATGGCGATGCTGG 
TGGGTA 2155 
NtADC2 

. . .C. . .A 

. . .T. . 2159 

550 ********** 
***p********* 

* * * * 



576 YLGMFLGGAY 
EEALGGLHNLFGG 
P S V V 
NtADCl 

TTATCTGGGGATGTTTTTGGGTGGGGCTTATGAGGAG 
GCGCTCGGAGGACTCCACAACCTGTTTGGTGGACCAA 
GCGTGG 2235 
NtADC2 
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630 *♦♦*♦**♦♦♦ 

.T..C. 2239 ****♦**♦*♦♦*♦ 
577 **♦♦*♦♦*** 



**** 656 KGLAIASLAS 

SLAQSFHNMPYLV 

603 RVVQSDSAHS APAS 
FAMSRSVPGPSCA NtADCl 

D V L CAAAGGGCTGGCCATTGCATCTTTGGCCAGCAGCTTA 

NtADCl GCTCAGTCCTTCCATAAOVTGCCTTACCTTGTGGCGC 

TGCGCGTGGTGCAGAGCGATAGCGCTCACAGCTTCGC CTGCAT 2475 

CATGTCTCX5CTCCGTCCCTGGCCCGTCCTGCGCGGAC NtADC2 

GTGCTC 2315 TG. . .A. . . i G. . 

NtADC2 

T. . . .T. . . 2479 

....A T T.T. 657 ****v. E**** 

2319 *V*********** 

604 ********** **s* 
***T********* 

* * * 683 CCFTAVTANN 

GGYNYYYSDENAA 

629 RAMQHEPELM DSA 

FETLKKRAEEFLE NtADCl 

Q E E D CTTGCTGCTTCACTGCAGTTACTGCTAACAACGGTGG 

NtADCl CTATAACTACTATTACAGTGATGAGAATGCAGCAGAT 

CGAGCGATGCAGCACGAGCCCGAGCTCATGTTCGAGA TCTGCT 2555 

CTCTCAAGCACCGTGCGGAGGAATTCTTGGAACAAGA NtADC2 

AGAAGA 2395 . C T,C A T 

NtADC2 T 

2559 

684 * R * * * A * D * * 

...T.. 2399 ********** *** 

* * * 
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709 TGEDEIWSYC 
T A **♦ 

NtADCl 

ACAGGGGAGGATGAGATTTGGTCCTATTGCACTGCTT 
OAagtgttgtcgtagcatctccagttttagtttgtcg 
tcgaag 2635 
NtADC2 



g. 2639 

710 ******** 
* * * * *** 

720 
NtADCl 

ttgtctgtttttgaataatacccttagttggtgatgt 
ttttct 



T 



GA 



c 



2678 



NtADC2 



aataata 



2682 



ni 
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— 1.0 

— 0.5 



WO0067558 [file:/A\dcwas03\firmdata\lp\FolevPat\PatentDocuments\W00067558.CPC ] Page 61 of 103 

WO OQ/67558 PCTAJSOO/12450 

18/21 

00C2 CCTAfiCCCTT CAACAOCTAT TTCTCTAAAA AJUUUUUUAA AAGAAAAAAA TACTACGTAG ATTACACAAT ATTATCAffTA (TTACTATCAC 1 1 1 ILUICCC 
OOCl AAACTSAOTT TCACCCAATA TCASCGTCTS AAAAOCCCAA AAAAACACTT 1 1 II 1 1 1 ATT TTTTTTTATT TCCTCCJUUU AACACATTTT AAGGTATTTT 

-3 $ TATA be* • • • iS 

OOCl ratxatstmr gataaacatt rrTACAoerr ictccai ct c unnnnnyi. Mnn>»n>T scASATtATT gaaicccxao i hciiiici i tea: rim a 
OOCl TAAccACArr nticiccn i cciitccccc ctqgtattoo attoctraa caatobcrc jukocATcxrro GOJcevat ncuii i iti ttocctttga 

« ICS 
00C3 'r H.C ri' L«. T C TCATTTACCT CICILimC II C LMIUH TOMTOGOCG OCCAAACAAT CATOCTTTOC GGerTCAJCC cosogoccat imiA CtOC 
OOCl TTCCTTCCTC TCATTTACCT CTATCTTTTC TTACTTTCTT TGOATQSfCG GCCAAACAAT CATCCmOC CGCTTCAACC CACOGOCCAT TCTTCAGTOC 
pO0C2 MA GQTI XVS GLM PAA2 LQS 

pODCl V C 

ICC 2CS 

0DC3 ACAATTGOCG GCGUGCTTC TOCTACABCG CCCCC CCCGG CGGAAAACGO CACCACAAAA GICATCCCTC TCTCAMSAGA TGCCTTACAA GATTTCATGT 
OOCl ACAATTGCCG OCOGAGCTTC TCCTACMaCO fjO dd ir/X tdd CGGAAAACGA C A CC A C IUUA GTCATCCCTC TTICAACACA TGCCCTACAA GATTTCATer 
pODCZ TZ6 GGAS PTA AAA AIMG TRK VIP LSAD ALQ D W H 
pOOCl 0 

3CC ac5 

0DC2 TATCAATCAT AACOCAAAAA TTACAAGATG AGAAACAACC II IH ACGTC CTAGACTTGG GTCAGGTTST TTCTCTTATC GACCAATGCA AATCTGCTCT 
ODCl TATCAATCAT AACTCAAAAA TTACAAGATG AGAAACAACC 1 1 II TACGTG CTAGATTTGG GTGACGTTGT TTCTCTTATC GACCAATGCA AATATOCTCT 
pO0C2 LSXI TQK LQO CKQP fTV bOL GEVV 8LM OQW KSAL 

pooci r 

sec 4CS 

00C2 CCCAAATATC CGTCCATTTT ACCCTGTTAA ATGTAACCCT GAACCCTCST TCCTTTCAAT TTTATCTCCT ATGCGCTCAA ATTTTGATTG TOCTAGCCCA 
ODCl CCCAAATATC CGTCCATTTT AOGCTGTTAA ATGTAACCCT GAATCGTIGT TCCTTTCAAT TTTATCTGCT ATGCGCTCAA ATTTTGATTG TGCTAOCCGA 
pO0C3 PMZ Apr TAVR CMP SPS fbSX LSANG6 NFOC ASA 

pODCl B L 

«CC SC5 

O0C2 GCTGAAATTG AGTATOTTTT ATCTCTTGGC AmCACCTO JMOOOTAnor TTrOCSaUUlT CTWOOAItf CGOAATCCGA TATTATTTTT GCAOCAAAAC 
ODCl GCTOAAATTO AGTATGTm ATCTCTTGGC AATAMAAAA QMUflOCWIC AJUrgOORAC nOASTfaAT aJUUWTTraQ GAAATTAATA TTGGGGTTQT 
pODC2 All lYVL SL6 ISP ORZV PAN PCX PtXD lit AAK 
pODCl VKK RSRS M6T LI. 

5C« CCS 
0002 TTOQOOTGAA TCTTACAACC TATOATTCTO AAfiACSAfiOT TTACAAOATC COAAMCA T C ACCOBAAATC C0AACTCTT6 CTCCOCATCA AOOBCATaCT 
OOCl CrrCOTATCO TCATOOOAAT CTTTAOCTCA AOTTATAACA AATTTQOAOa AOTTTCTCTT AMAATTTGO ATTAAAATCO TOCTTGGAAC AAGAACACAC 
VeVH LTT YDS tOCV TK2 RXI 8PKS SIL LRX RPNk 



CC6 7CS 
0DC2 CGACOGCAAC CCGACATOCC CAATGOOCCC GAAATACGOC GCOCTTCCAO AAGAAGTCGA C CC OC T OC TC COGOCAOCTC AAOCCGCCCG TCTCACCOTA 
OOCl ATCAATAAAS COAAGAACAC CAA0ACCACT GATTTCCAAA ACACCAAATT TCAATTTTTT TAAAOOTTTT CIIICIIOO T TGGGTCTAAA TTAAOCTTTT 
pQ0C2 ' OON ARC PROP KTO ALP IBVO PLL RAA QAAR LTV 

7CC ecs 

00C2 TCCGGCCTCT CATTCCACAT CGGTAGCGGA GATGCOGATT CAAACGCTTA TCTCGOCOCC ATAOCOGCGG CTAAGGAAOT GTTTGAAACA CCTGCTAAAC 
ODCl CTTTTCrm TTAfiAATOTT ATTTTTATTT TATTTATTAA ATAOATTTAA CATACTTTTT TTTACTCAAA ATAATATATG TC Al 111 1 1 1 ATTCOTCACT 

P0DC2 scv sraz oso dao smat lga zaa akkv riT aak 

CCC MS 

O0C3 TCQOGATCTC GAAAATGACT GTTCTAOAOS TOGGCG0008 OTTTACATCC CGC C A CC AOT TCACAAOCGC C aC CUrTC GC C OrTAAATCAS CTTTAAAACA 
ODCl OGCCACGTCA GCAGCGAOTO CATTGCACAA ACTTTOTAAa TTTOGCTGAT TOTTAAATAA CTGCTCAATT GGAACAAAST TCATGTAACG TATCTOCTCA 
p00e2 LCMS RNT VLD VGOO ITS GBQ PTTA AVA VRS ALRQ 

ICC IOCS 
O0C3 ACACTTCGAT GACCAACCCC ACTTGACAAT CATAOCTGAA CCOOOTOOOT TTTTTGCAGA GACOCCGTTT JkCTTTOOCAA COACGATTAT AfSGCAAAACA 
ODCl ATAGCAACTC TCTTAACTTT ACGTCTCTAA ATCAAACATC CTOCCAACTT TAAGTC1CTC CeTATCTATT CACCCAAAAT AATGTAACCC AAATGTAGTC 
pOOC2 HrO DEP ILTZ ZAI PGR rfAS TAT TLA TTZZ GKR 

io«c lies 

00C2 OTGAGQGCTO AATTGAGGGA GTATTQGATT AACGACGCGC TGTACOGTTC GATGAACTGT GTACTTTAC6 ACC^^-COAC GGT6AATGCA ACGCCCTTAfi 
OOCl AATAAAGCOA TCGTOCTAGA ACCACOGGAC TCAOOOAATO CCTTACAOCT TCTCCCCOOT CAACAOAATT CCrTACT CGQ AOTTTGTTTT CGAAfiACCAA 
P00C2 VRG ILRI THX RDO LYOS NMC VLY DKAT VHA TPL 

lUC 12C5 

0DC2 CTCTTCTGTC CAATCGTAGT AACGTTACCT GCCGCCGCTC CAAAA CUi 1 1 CCGACGACTO TOTTTOOGCC CACTTOTGAT OCTCTTGATA CTOTTTTAAG 

ODCl TAATAATAGA GTGAAACCTT CCTTTGAATA GGOATTCAAA AAAAAOGTGA CTTGGAACAC CACCAAAAA T TAATrCCTAO TGGCGACACT GTAAATAAAA 

PO0C2 AVLS MRS VVT CaO« RTf PTT VfGP TCO ALD TVLR 

12 CC 13CS 
O0C2 GGATTAOCAO TTA C COOACC TOCAGOTTAA TOATTBOCTO OTTTTTCCTA ATATGOOTGC TTATACTAAA q CTO C TO C CT CCAATTTTAA TOOATTTAAT 
ODCl TAATOCCTAT TTCAAATTTO TCACTTTAAT TGGAAAAACT CTTTCAOOCA CAATOCATAA CAACACATTA ItlUIUU AO OTOTAAAAAO OTOATOTOAC 
pOOCa OYO LPt LQVir DHL VFP MNGA TTR AAO SNPir GFR 

13CC 14CS 
00C2 ACTTCCGCCA TTBTTACTCA CCTCOCTTAT TCTTATCCAA OCTGATGAAC CACCT OT ATT AOOAATTACT ACCOTOOTTT TQATGOTTTT llCtlll II J 
ODCl ACCTCTAOCA ACTCTQCTGG QGOCTATTAA TAAOAATTOG ACCnTOTAT ATTOATTTTT ATTTOSCnT TATCATCTCT TOGATATTAT TOTCTTTGGG 
pOOC3 T8A XVTB LAY SYP S, 

14CC P oly A •Ign al 15C5 

00C3 OOerATCTTT TTTTTAATTT HJIIUIIlll GOTAfiTAATT TATATTCCAA ATCACCTTOT AATTCTCTTS TATGC dSSgBSSSBB rCCAAGG ATTTGCTAAT 
ODCl AGCATAATOT TCTATTTGTC TCTTATTTAT COCTTTAATA OTTATTrAAA CTGTBATATA AATTGTATeC TATCTCOCAC CCCTCTGAOT CrTCTOATAS 

15« iPoly A alt* i«» 

0DC2 TOTGATTTTC TCTAATATGO AAGTTTTTAA AATTAOTTTA AGAAACATAA TCGGTAAAAG OIliUIUU GO TCATGATATT TOTCTGACTA taaaaccatc 
OOCl GTACITATer T C TCTTT GCC TACCAOCATC ATAATATTTC TCTCTTGAGA TAAAOOCAGT TAfiCCTACCA OCTTTTOGTO AAGGATTTAA TCACATATGT 
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N. tabacuxn cv Xanthi 

N. tabacum cv BY2 

N. tabacum cv SC58 

D. stramonium 

L. esculentum 

S. cerevisiae MSSTQVGNALSSSTTTLVDI 

H. sapiens MNflF — GN- 




iYYKD|£TLHNLLLELK|NQD 



N. 
N. 
N. 
D. 
L. 
S. 
H. 



N. 
N. 
N. 
D. 
L. 
S. 
H. 



N. 
N. 
N. 
D. 
L. 
S. 
H. 



N. 
N. 
N. 
D. 
L. 
S. 
H, 



N. 
N. 
N. 
D. 
L. 
S. 
H. 





tabacum cv Xanthi 
tabacum cv By2 
tabacum cv SC58 
stramonium 
esculentum 

cerevisiae L£LL|HEQAHPKIFC^KARIGRgNNETCDPG|£NSBFICE 
sapiens EEFDCH— FLDEGFTg-KllLDQKlNEVSSSf 

101 

tabacum cv Xanthi 

tabacum cv BY2 
tabacum cv SC58 
stramonium 
esculentum 
cerevisiae 
sapiens 

151 

tabacum cv Xanthi 
tabacum cv BY2 
tabacum cv SCS8 
stramonium 
esculentum 
cerevisiae 
sapiens 

201 

tabacum cv Xanthi 
tabacum cv BY2 

tabacum cv SCS8 

stramonium 
esculentum 
cerevisiae 
sapiens 

251 

tabacum cv Xanthi 
tabacum cv BY2 
tabacum cv SC58 
stramonium 
esculentum 
cerevisiae 
sapiens 



150 



200 




250 




300 




301 

N. tabacum cv Xanthi 
N. tabacum cv BY2 



350 
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N. tabacum cv SC58 

D. stramonium gs, 

L. esculentiim 

S. cerevisiae STAVLl 

K. sapiens ITGVIN 

351 

N. tabacxam cv Xanthi 

N. tabacum cv BY2 

N. tabacum cv SC58 

D. stramonium 

L. esculentum 

S. cerevisiae 

K. sapiens 




EAMIYl 

tgsdded|ss|qtfmy) 



401 

N. tabacum cv Xanthi 

N. tabacum cv BY2 

N. tabacum cv SC58 

D. stramonium 

L. esculentum 

S. cerevisiae 

H. sapiens 



450 




dfesttavldsi 
llqkr: 



451 



500 



N. 


tabacum cv 


Xanthi 


N. 


tabacum cv 


BY2 


N, 


tabacum cv 


SC58 


D. 


stramonium 




L. 


esculentum 




S. 


cerevisiae 




H. 


sapiens 





N. 
N. 
N. 
D. 
L. 
S. 
H. 




tabacum cv 
tabacum cv 
tabacum cv 
stramonium 
esculentum 
cerevisiae 
sapiens 



501 
Xanthi 
BY2 
SC58 



IDIVYIDSELD- 
i-QRPTIYYVMSGPAWELMQQFQNPDFPPE 

533 



VEEQDASTLPVSCAWESGMKRHRAACASASINV 
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B 



ODC 



7B6-1 



PMT 



Low Alkaloid Wild-type 
0 4 12 24 . 0 4 12 24 hr 




28SrRNA 




WO0067558 (file:/A\dcwasQ3\firmdata\lp\FolevPat\PatentDocuments\WOQQ67558.CPC] 



Page 65 of 103 



WO 00/67558 PCTAJSOO/12450 

SEQUENCE LISTING 

<110> Timko P, Michael 

<120> Regulation of Gene Expression in Tabacco for 

Manipulation of Plant Growth and Secondary Metabolism 

<130> 4981*239 

<140> 
<141> 

<160> 26 

<170> Patentin Ver- 2.0 

<210> 1 

<211> 1120 

<212> DNA 

<213> Plant 

<400> 1 

ctgagttgac aagaacaatt cctggtgaat cagatggatg aagataatag aggtgggtgg 60 
aatctataac caaagcagct ggttgagtga ctgtgcgagt tgcagaaaca attgaagggt 120 
catttgtgga atttggggcc atttcaaagg aaaaagaaaa gatgacttag cattaataaa 180 
tcaaattaaa ataaggctta gcgttaaaat caaaggaaat ggcaagcctg gctcctggag 240 
caatgcttct gaggacagta gtaaaaacaa tatcagacaa aaagtaaagt tgtattattt 300 
agcttgagga taaagtatgt cattagtttt gtgagagatt tggtgtcctc tacaatgatt 360 
gttgaagtcc ctatttatag ctatacacag gaaacaaaat cctaggatca agcccctctt 420 
aaatgacaat aatggggtta atgatgaata tgtagcggca tgacatgaat gccaaaattc 480 
tccgcaacga ctatttattt aatattgagg aatatttttt attaaatact atctggtgac 540 
aagcattcgt ttgcttccgt tgattacgtt gattttggga tctactccat accaaccgaa 600 
gccgttgtcc ttgatcttcg ctttcattta attcatcttc cgtctgcctc cgatttcaca 660 
agtcatgcac ccattcaatt atttaatgga aaccaatttt accctataca aatggtacat 720 
cattcgtcaa atactttact tggatataaa caattttgcc cgaggagtaa acagatgcga 780 
agaaagaaag cagacgatta aagaaatttt taaaaaagga gagagaaatg aacacacaca 840 
tgtactaata aaattagggt actactttac taataattgg acagagacta aattcatatt 900 
ttagttccaa aatgtctcgg gcagtccaac catgcacgtt gtaatgattt tttaactcta 960 
ttatatcgag ttgcgccctc cactcctcgg tgtccaaatt gtatataaat gcatatgtgt 1020 
ctattgggag tgtacatcaa gctttcataa agtacaaatc gtaatacttg ttgaaacata 1080 
atactttctc ttctccaatt tgtttagttt aattttgaaa 1120 

<210> 2 

<211> 3091 

<212> DNA 

<213> Plant 

<400> 2 



1 
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ctgagttgac aagaacaatt cctggtgaat 

aatctataac caaagcagct ggttgagtga 
catttgtgga atttggggcc atttcaaagg 
tcaaattaaa ataaggctta gcgttaaaat 
caatgcttct gaggacagta gtaaaaacaa 
agcttgagga taaagtatgt cattagtttt 
gttgaagtcc ctatttatag ctatacacag 
aaatgacaat aatggggtta atgatgaata 
tccgcaacga ctatttattt aatattgagg 
aagcattcgt ttgcttccgt tgattacgtt 
gccgttgtcc ttgatcttcg ctttcattta 
agtcatgcac ccattcaatt atttaatgga 
cattcgtcaa atactttact tggatataaa 
agaaagaaag cagacgatta aagaaatttt 
tgtactaata aaattagggt actactttac 
ttagttccaa aatgtctcgg gcagtccaac 
ttatatcgag ttgcgccctc cactcctcgg 
ctattgggag tgtacatcaa gctttcataa 
atactttctc ttctccaatt tgtttagttt 
cacaaatggc tctaccatct tcaagagtgg 
cacttccaaa caccaaaacg gccacaagaa 
cagccttgat aatggcaacg agctactggg 
ttcagagttt agcgcattat ggccaggtta 
aaagttaaaa ttgttaggct aatataagga 
ggaaaaagta tcaaataaat tcaaaaaatg 
taatttgaaa taaatcgaat tttgcaggtg 
tgttccaggg gaagtctgac taccaagatg 
tacacatgct tccatttaaa ttgatacttt 
gtacagtcag caacttatgg gaaggttctg 
aatggtggat ttccatacac tgaaatgatt 
ccaaaaaagg ttttgatcat cggcggagga 
tatcctacaa tcgaaaaaat tgacattgtt 
caaacttctt ttactcacat aaaaaaatgg 
gaatactatt tttttaaaac aaaattttct 
tatctcgctg ctaattttaa cgatcctcgt 
ttgataatct cgcttttgtt ttatctttta 
tgtgtggtta attcacctgc cattggttct 
ctgcacaagc agaatattat gatgctatta 
tattacttct taataccaag actaatctta 
tttctaaaac aatataattt caggtccagc 
ggcagtagct aaagccctaa ggccaggagg 
gcttcatatg catattatta agcaaatcat 
tgtcaactat gcttggacta ctgttccaac 
ttcctataaa attggaagtt ttgattctat 
gaaaaaccaa cttcttttct tttactcttc 
atatgatcaa ttattttgat ttcagcggtg 
gaccagaaat tgacttcaag aatccagtaa 
. agtccaaatt agcacctctc aagttctaca 
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cagatggatg aagataatag aggtgggtgg 60 

ctgtgcgagt tgcagaaaca attgaagggt 120 
aaaaagaaaa gatgacttag cattaataaa 180 
caaaggaaat ggcaagcctg gctcctggag 240 
tatcagacaa aaagtaaagt tgtattattt 300 
gtgagagatt tggtgtcctc tacaatgatt 360 
gaaacaaaat cctaggatca agcccctctt 420 
tgtagcggca tgacatgaat gccaaaattc 480 
aatatttttt attaaatact atctggtgac 540 
gattttggga tctactctat accaaccgaa 600 
attcatcttc cgtctgcctc cgatttcaca 660 
aaccaatttt accctataca aatggtacat 720 
caattttgcc cgaggagtaa acagatgcga 780 
taaaaaagga gagagaaatg aacacacaca 840 
taataattgg acagagacta aattcatatt 900 
catgcacgtt gtaatgattt tttaactcta 960 
tgtccaaatt gtatataaat gcatatgtgt 1020 
agtacaaatc gtaatacttg ttgaaacata 1080 
aattttgaaa atggaagtca tatctaccaa 1140 
tgccattccc atgaatggcc accataatgg 1200 
tgggacttcc gaacaacaga acgggacaat 1260 
aaactccaat tgtattaagc ctggttggtt 1320 
gtactgagaa agaaactcaa atgcatattt 1380 
gttgatattc ttttagtgat taattaaaaa 1440 
gatagtaact tcgcatatta ctctacacat 1500 
aagcattctc acttaaggtt gagaagttac 1560 
tcatgctctt tgaggtaaat aatattttaa 1620 
taatttactt ttactttatt gcatgtgtac 1680 
actttggatg gagcaattca acacacagag 1740 
gttcatcttc cacttggttc catcccaaac 1800 
attggtttta cattattcga aatgcttcgt 1860 
gagatcgatg acgtggtagt tgatgtaagt 1920 
tttagattgc ttcttgttat ttttctaaaa 1980 
tttttacagg tatctagaaa atttttccct 2040 
gtaaccctag tccttggaga tggtgcgtat 2100 
tttttattgc atttaatttt taccttttgg 2160 
ctttcatttc aggggctgca tttgtaaagg 2220 
tagtggactc ttctgatccc attggtactc 2280 
ttgaataagc tactaataaa cggtaattga 2340 
aaaagatttg tttgagaggc cattctttga 2400 
agttgtatgc acacaggctg aaagcatttg 2460 
tgctaactgt cgtcaagtct ttaagggctc 2520 
atatccaacg tatttttctc tctctctctc 2580 
aattgtcaag aaatggagaa tcagttccaa 2640 
aaggtattgt gtttaatttt ttttcaactg 2700 
tgattggtta tatgctctgc tctactgaag 2760 
atccaattga caaagagaca gctcaagtca 2820 
actctgatgt aacttcatat ctcacaattt 2880 
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cttttttcct attgtacttt atgttcttcg tcaaatttta taattaactc ttttcaaatt 2940 

gtcttttttt ttttcagatt cacaaagcag cattcatttt gccatctttc gccagaagta 3000 

tgatcgagtc ttaatcaact gattaatgaa tactggtggt acaatcattg gaccaagatc 3060 

aataagtgaa agacgtattg tatgagaatt c 3091 

<210> 3 

<211> 353 

<212> PRT 

<213> Plant 

<400> 3 

Met Glu Val He Ser Thr Asn Thr Asn Gly Ser Thr He Phe Lys Ser 
15 10 15 

Gly Ala He Pro Met Asn Gly His His Asn Gly Thr Ser Lys His Gin 
20 25 30 

Asn Gly His Lys Asn Gly Thr Ser Glu Gin Gin Asn Gly Thr lie Ser 
35 40 45 

Leu Asp Asn Gly Asn Glu Leu Leu Gly Asn Ser Asn Cys He Lys Pro 
50 55 60 

Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala Phe Ser 
65 70 75 80 

Leu Lys Val Glu Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr Gin Asp 
85 90 95 

Val Met Leu Phe Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr Leu Asp 
100 105 110 

Gly Ala He Gin His Thr Glu Asn Gly Gly Phe Pro Tyr Thr Glu Met 
115 120 125 

He Val His Leu Pro Leu Gly Ser He Pro Asn Pro Lys Lys Val Leu 
130 135 140 

He He Gly Gly Gly He Gly Phe Thr Leu Phe Glu Met Leu Arg Tyr 
145 150 155 160 

Pro Thr He Glu Lys He Asp He Val Glu He Asp Asp Val Val Val 
165 170 175 

Asp Val Ser Arg Lys Phe Phe Pro Tyr Leu Ala Ala Asn Phe Asn Asp 
180 185 190 

Pro Arg Val Thr Leu Val Leu Gly Asp Gly Ala Ala Phe Val Lys Ala 



3 
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195 200 

Ala Gin Ala Glu Tyr Tyr Asp Ala 
210 215 

He Gly Pro Ala Lys Asp Leu Phe 
223 230 

Ala Lys Ala Leu Arg Pro Gly Gly 
245 

He Trp Leu His Met His He He 
260 

Gin Val Phe Lys Gly Ser Val Asn 
275 280 
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205 

He He Val Asp Ser Ser Asp Pro 
220 

Glu Arg Pro Phe Phe Glu Ala Val 
235 240 

Val Val Cys Thr Gin Ala Glu Ser 
250 255 

Lys Gin He He Ala Asn Cys Arg 
265 270 

Tyr Ala Trp Thr Thr Val Pro Thr 
285 



Tyr Pro Thr Gly Val He Gly Tyr 
290 295 

Glu He Asp Phe Lys Asn Pro Val 
305 310 

Gin Val Lys Ser Lys Leu Ala Pro 
325 

His Lys Ala Ala Phe He Leu Pro 
340 



Met Leu cys Ser Thr Glu Gly Pro 
300 

Asn Pro He Asp Lys Glu Thr Ala 
315 320 

Leu Lys Phe Tyr Asn Ser Asp He 
330 335 

Ser Phe Ala Arg Ser Met He Glu 
345 350 



Ser 



<210> 4 
<211> 711 
<212> DNA 
<213> Plant 

<400> 4 

gaattcaatg gagaaggaaa atatttccag 
aataatctct atcattcaag ccttaggtgg 
aagtaatagg tgatcaacag ctttcgtaaa 
tatgctgaag aacccacata aggaagatca 
gagctttatt tttaaagagt ggctagctgg 
aaattttact attatttgtt ttaagaggga 
agaggaagaa aggagacaga agaggaaata 
tcaaagaaaa tttttaaaat ggagagagaa 
actaataatt gcaccgagac aaacttatat 



tgtaaacaca agtgaatgaa gagaagccaa 60 

agattaaaaa aattatttac tttcttatca 120 

acgtcattag gagaatatta taatctcttt 180 

taaaatacat gactttcaga tgacttcttg 240 

tcagcaaaga ggtgctcgtc agatatcata 300 

gatggggcac acatgcttgt gacaaaagta 360 

gatttggggg gggggggggg ggtttcacaa 420 

atgagcacac acatatacta acaaaatttt 480 

tttagttcca aaatgtcagt ctaaccctgc 540 
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acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 
tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 
tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa a 711 

<210> 5 
<211> 3129 
<212> DNA 
<213> Plant 

<400> 5 

gaattcaatg gagaaggaaa atatttccag tgtaaacaca agtgaatgaa gagaagccaa 60 

aataatctct atcattcaag ccttaggtgg agattaaaaa aattatttac tttcttatca 120 
aagtaatagg tgatcaacag ctttcgtaaa acgtcattag gagaatatta taatctcttt 180 
tatgctgaag aacccacata aggaagatca taaaatacat gactttcaga tgacttcttg 240 
gagctttatt tttaaagagt ggctagctgg tcagcaaaga ggtgctcgtc agatatcata 300 
aaattttact attatttgtt ttaagaggga gatggggcac acatgcttgt gacaaaagta 360 
agaggaagaa aggagacaga agaggaaata gatttggggg gggggggggg ggtttcacaa 420 
tcaaagaaaa tttttaaaat ggagagagaa atgagcacac acatatacta acaaaatttt 480 
actaataatt gcaccgagac aaacttatat tttagttcca aaatgtcagt ctaaccctgc 540 
acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 
tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 
tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa aatggaagtc 720 
atatctacca acacaaatgg ctctaccatc ttcaagaatg gtgccattcc catgaacggc 780 
caccaaaatg gcacttctga acacctcaac ggctaccaga atggcacttc caaacaccaa 840 
aacgggcacc agaatggcac tttcgaacat cggaacggcc accagaatgg gacatccgaa 900 
caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaag ctccgactct 960 
attaagcctg gctggttttc agagtttagc gcattatggc caggttagta ctaagaaagc 1020 
aactcaaatg catcggcctc ttgttgctac taaatataga gagctatcat acttttaggg 1080 
actaactaaa aaggaaagat tatcacaggg acgaagtgag cagttaactt cgcatattat 1140 
cagacgcatt aatttgaaat aatcgaattt tgcaggtgaa gcattctcac ttaaggttga 1200 
gaagttacta ttccagggga agtctgatta ccaagatgtc atgctctttg aggtaattaa 1260 
tattctaata cacatgcttt aatttaaagt gatactttta atttactttt agtttattgc 1320 
atgtgcacgt acagtcagca acttatggga aggttctgac tttggatgga gcaattcaac 1380 
atacagagaa tggtggattt ccatacactg aaatgattgt tcatctacca cttggttcca 1440 
tcccaaaccc aaaaaaggtt ttgatcatcg gcggaggaat tggttttaca ttattcgaaa 1500 
tgcttcgtta tccttcaatc gaaaaaattg acattgttga gatcgatgac gtggtagttg 1560 
atgtaagtca aacttctttt acccacataa agaaaatgat ttagattgca attcttttta 1620 
tttttctaaa agaataaata tattctcttt ttttttttta aaacaaaatt ctctttctta 1680 
caggtatcca gaaaattttt cccttatctg gcagctaatt ttaacgatcc tcgtgtaacc 1740 
ctagttctcg gagatggtgc gtatatgata gtctcgtttt atattttatt tcacttgatt 1800 
tttacctttt tttgtggtta attaatcatc taccattggt tctctttacc ttcaggagct 1860 
gcatttgtaa aggctgcaca agcgggatat tatgatgcta ttatagtgga ctcttctgat 1920 
cccattggta cgctattact atttaatacc aagactattc ttattaaata agctactaag 1980 
aaactaattg aataattaat aaacgtaact gtaattgatt tctaaaataa tatatataat 2040 
ttcaggtcca gcaaaagatt tgtttgagag gccattcttt gaggcagtag ccaaagccct 2100 
taggccagga ggagttgtat gcacacaggc tgaaagcatt tggcttcata tgcatattat 2160 
taagcaaatc attgctaact gtcgtcaagt ctttaagggt tctgtcaact atgcttggac 2220 
aaccgttcca acatatccca cgtattcttt ttctctctct ctcttcctgt ctttttcgat 2280 
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gcaatgtaaa tttataaaat tggaagtccg ttttactttt ctatagacgt 
attgtcaaga aatggagaat tgacttacaa gaaaaatcaa cttcttttca 
tttttggtga caaactttac ttattatttc gttctaaaat gaaaatttat 
taaaataatt tagctttaaa cttttaattt tacttgttat atttttaata 
atagtcaaat aaatgttgtg accatataaa aacctccgca tttttaagat 
agagtcaaac gagttaattt atttttagta tgccggtgcg gagtcaaatt 
aattgaaacg gagtgagaac atttttattt cgagtaaact ttcaaggtat 
ttcaagtgat actgatcaat gatgtcttaa atattttgat ttcagcggtg 
tatgctctgc tctactgaag ggccagaagt tgacttcaag aatccagtaa 
caaagagaca actcaagtca agtccaaatt aggacctctc aagttctaca 
aacttcatat ctcacaattt ctttttccgt tttactgtat gttcttcgtc 
actaactctt ttcatattgt cttttttttc agattcacaa agcagcattc 
ctttcgccag aagtatgatc gagtcttaat caagtgaata atgaacactg 
cattggacca agatcgagtc ttaatcaagt gaataaataa gtgaaatgcg 
ggagaattc 

<210> 6 
<211> 375 
<212> PRT 
<213> Plant 

<400> 6 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr lie Phe Lys Asn 
1 5 10 .15 

Gly Ala He Pro Met Asn Gly His Gin Asn Gly Thr Ser Glu His Leu 
20 25 30 

Asn Gly Tyr Gin Asn Gly Thr Ser Lys His Gin Asn Gly His Gin Asn 
35 40 45 

Gly Thr Phe Glu His Arg Asn Gly His Gin Asn Gly Thr Ser Glu Gin 
50 55 60 

Gin Asn Gly Thr He Ser His Asp Asn Gly Asn Glu Leu Leu Gly Ser 
65 70 75 80 

Ser Asp Ser He Lys Pro Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp 
85 90 95 

Pro Gly Glu Ala Phe Ser Leu Lys Val Glu Lys Leu Leu Phe Gin Gly 
100 105 110 

Lys Ser Asp Tyr Gin Asp Val Met Leu Phe Glu Ser Ala Thr Tyr Gly 
115 120 125 

Lys Val Leu Thr Leu Asp Gly Ala He Gin His Thr Glu Asn Gly Gly 
130 135 140 



PCT/USOO/12450 

agatcctaaa 2340 
tttactattc 2400 
ttttatattt 2460 
aaaaagattt 2520 
cataagtttc 2580 
atgtcataaa 2640 
tgtgtttaat 2700 
tgatcggtta 2760 
atccaattga 2820 
actctgatgt 2880 
aaattttata 2940 
attttaccat 3000 
gtagtacaat 3060 
acgtattgta 3120 
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Phe Pro Tyr Thr Glu Met lie Val His Leu Pro Leu Gly Ser lie Pro 
145 150 155 160 

Asn Pro Lys Lys Val Leu He He Gly Gly Gly He Gly Phe Thr Leu 
165 170 175 

Phe Glu Met Leu Arg Tyr Pro Ser He Glu Lys He Asp He Val Glu 
180 185 190 

He Asp Asp Val Val Val Asp Val Ser Arg Lys Phe Phe Pro Tyr Leu 
195 200 205 

Ala Ala Asn Phe Asn Asp Pro Arg Val Thr Leu Val Leu Gly Asp Gly 
210 215 220 ^ 

/ 

Ala Ala Phe Val Lys Ala Ala Gin Ala Gly Tyr Tyr Asp Ala' He He 
225 230 235 / 240 

/' 

Val Asp Ser Ser Asp Pro He Gly Pro Ala Lys Asp Leu Phe Glu Arg 
245 250 255 

Pro Phe Phe Glu Ala Val Ala Lys Ala Leu Arg Pro Gly Gly Val Val 
260 265 270 

cys Thr Gin Ala Glu Ser He Trp Leu His Met His He He Lys Gin 
275 280 285 

He He Ala Asn Cys Arg Gin Val Phe Lys Gly Ser Val Asn Tyr Ala 
290 295 300 

Trp Thr Thr Val Pro Thr Tyr Pro Thr Gly Val He Gly Tyr Met Leu 
305 310 315 320 

cys Ser Thr Glu Gly Pro Glu Val Asp Phe Lys Asn Pro Val Asn Pro 
325 330 . 335 

He Asp Lys Glu Thr Thr Gin Val Lys Ser Lys Leu Gly Pro Leu Lys 
340 345 350 

Phe Tyr Asn Ser Asp He His Lys Ala Ala Phe He Leu Pro Ser Phe 
355 360 365 

Ala Arg Ser Met He Glu Ser 
370 375 



<210> 7 
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<211> 1134 
<212> DNA 
<213> Plant 



<400> 7 

gctgtacaaa aggatgtctc aaatcatttg 
ccccactatt aagacccatt atcactggca 
acctgtccct atttggaaga gtgtggtatg 
gaatactgat ggaagtttta acaaacaaat 
agatgaagag ggaggctttg tcatggcttt 
tgaagcagaa ttgaaagcca tcaagtatgg 
aaacttcatt gtggaaactg actcgaggat 
aagcaacaac aagttgaaac aagagaccga 
gacacctgtt acccattgcc ttcgcgaagc 
ggccaccaga gctaacgaag gtatcactca 
caagggccct ttcttcatgg atatgtggca 
atctaatttt tttttgtaag ttaattctgt 
tttttgccca tagactcttc ctctccttag 
gttcccctca gtgtaacctt tttttgttta 
cccccaacac cacaggggat agatacctgg 
tttactaata attgcacgga gacaaaactt 
atgcacgttg taatgatttt ttaactctat 
gtccaaattg tatttaaatg catagatatg 
aaatacaaac cataatactt tctcttctcc 



ga a t a tta a t. 


^ 4^ ^ a a ^ ^ 

uCugCao L.Ca 






caaaaat-tat. 


gagatcat ta 


aaca c L^aa 


1 


ggagatgcct 


cccagggagu . 


9 ^ B B a ^f*^ 

aCCuaaagCw 




tgggaaagca 


gggatit.ggag 


gga ttctcag 


o ^ n 
z ^ u 


Cucgacgccc 


acaaccuaca 


ataacaticag 


Q n n 


gtgtgaatgg 


tgcaaataca 


aaggaatatc 




gatctat-gac 


atactacaga 


ccaaaaatct. 


4 ^ n 


gaaattaatg 


gagattctgg 


acacctgcag 


480 


aaatcaagtg 


gcagactggt 


ttgctaaaga 


540 


tacagatttt 


agacaggtat 


caaaagcggc 


600 


ggtcccttat 


tttagaatta 


gatatgaaaa 


660 


gtatagtgag 


aggaaatcgt 


ctaatatgta 


720 


gtaaaaaggt 


agctccgagg 


taaggtttat 


780 


tataatagac 


atggtatggg 


tccagctaaa 


840 


gtgattggtt 


tattttttaa. 


aaaaaaaaac 


900 


atattttagt 


tccaaaatga 


cagtccaacc 


960 


tatatcgagt 


tccgccctcc 


actcctcggt 


1020 


tttattggga 


gtgtacatca 


agctttcaga 


1080 


aatttgctta 


gtttaatttg 


gaaa 


1134 



<210> 8 
<211> 3269 
<212> DNA 
<213> Plant 



<400> 8 

gctgtacaaa aggatgtctc aaatcatttg gaatattaat tctgcaatca acaagaaata 60 

ccccactatt aagacccatt atcactggca caaaaattat gagatcatta aacatcttaa 120 

acctgtccct atttggaaga gtgtggtatg ggagatgcct cccagggagt acctaaagct 180 

gaatactgat ggaagtttta acaaacaaat tgggaaagca gggattggag ggattctcag 2 40 

agatgaagag ggaggctttg tcatggcttt ttcgatgcct ataatctata ataacatcag 300 

tgaagcagaa ttgaaagcca tcaagtatgg gtgtgaatgg tgcaaataca aaggaatatc 360 

aaacttcatt gtggaaactg actcgaggat gatctatgac atactacaga ccaaaaatct 420 

aagcaacaac aagttgaaac aagagaccga gaaattaatg gagattctgg acacctgcag 480 

gacacctgtt acccattgcc ttcgcgaagc aaatcaagtg gcagactggt ttgctaaaga 540 

ggccaccaga gctaacgaag gtatcactca tacagatttt agacaggtat caaaagcggc 600 

caagggccct ttcttcatgg atatgtggca ggtcccttat tttagaatta gatatgaaaa 660 

atctaatttt tttttgtaag ttaattctgt gtatagtgag aggaaatcgt. ctaatatgta 720 

tttttgccca tagactcttc ctctccttag gtaaaaaggt agctccgagg taaggtttat 780 

gttcccctca gtgtaacctt tttttgttta tataatagac atggtatggg tccagctaaa 840 

cccccaacac cacaggggat agatacctgg gtgattggtt tattttttaa aaaaaaaaac 900 

tttactaata attgcacgga gacaaaactt atattttagt tccaaaatga cagtccaacc 960 

atgcacgttg taatgatttt ttaactctat tatatcgagt tccgccctcc actcctcggt 1020 



8 



WO0067558 [fite:/A\dcwas03\firmdata\lp\FolevPat\PatentDocuments\WO0067558.CPC] 



Page 73 of 103 



WO 00/67558 PCT/USOO/12450 

gtccaaattg tatttaaatg catagatatg tttattggga gtgtacatca agctttcaga 1080 
aaatacaaac cataatactt tctcttctcc aatttgctta gtttaatttg gaaaatggaa 1140 
gtcatatcta ccaacacaaa tggctctact atcttcaaga atggtgccat tcccatgaac 1200 
ggttaccaga atggcacttc caaacaccaa aacggccacc agaatggcac ttccgaacat 1260 
cggaacggcc accagaatgg gatttccgaa caccaaaacg gccaccagaa tggcacttcc 1320 
gagcatcaga acggccatca gaatgggaca atcagccatg acaacggcaa cgagctacag 1380 
ctactgggaa gctccaactc tattaagcct ggttggtttt cagagtttag cgcattatgg 1440 
ccaggttagt actaagaaag aaactcaaat gcatcgtact cttgtattct gctttgcgta 1500 
taatttagat gatggtgttt gactaagcac tgagtttaaa aataaaaagt ttaaagttaa 1560 
attgttacta tagagagcta tatctttagg aactaactaa aaaggaaaaa ttatcacata 1620 
aaattgggat gaagtaagca gttaacttcg catattattc gacacattaa tttgaaataa 1680 
atcgaatttt gcaggtgaag cattctcact taaggttgag aagttactat tccaggggaa 1740 
gtctgattac caagatgtca tgctctttga ggtaattaat taatactaat agtcaagctc 1800 
atgtatgatt atatttaaag tggtattttt cgtttatttt taatttattg cacgtgtacg 1860 
tacagtcagc aacatatggg aaggttctga ctttggatgg agcaattcaa cacacagaga 1920 
atggtggatt tccatacact gaaatgattg ttcatcttcc acttggttcc atcccaaacc 1980 
ctaaaaaggt tttgatcatc ggcggaggaa ttggttttac attattcgaa atgcttcgtt 2040 
atcctacaat cgaaaaaatt gacattgttg agatcgatga cgtggtagtt gatgtaagtc 2100 
aaacttcttt tactcacata aaaaaatgat ttagattctt atttttctaa aagaattaaa 2160 
acaaaatttt ccgttttaca ggtatctaga aaatttttcc cttatcttgc tgctaatttt 2220 
agcgatcctc gtgtaaccct agtccttgga gatggtgcgt atttgataat ctcgttttta 2280 
ttttatcttt tacttttatt ttatttaatt tttacctttt tgtgtgtggt taattcacct 2340 
gccattggtt ctttttattt caggggctgc atttgtaaag gccgcacaag caggatatta 2400 
tgatgctatt atagtggact cttctgatcc cattggtact ctattactac ttaataccaa 2460 
gactattctt attaaataag ctactaataa acgtaactct gatagttttc taaaataata 2520 
taatttcagg tccagcaaaa gacttgtttg agaggccatt ctttgaggca gtagccaaag 2580 
ccctaaggcc aggaggagtt gtatgcacac aggctgaaag catttggctt catatgcata 2640 
ttattaagca aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactatgctt 2700 
ggactactgt tccaacatat ccaacgtatt tttctctctc tcttcctata aaattggaag 2760 
ttttgattct ataattgtca agaaatggag aatcagttcc aagaaaaacc aaattctttt 2820 
cttttactct tcaaggtgtg tttaagtttt ttaaactgat actgatcaat tattttgatt 2880 
tcagcggtgt gattggttat atgctctgtt ctactgaagg accagaagtt gacttcaaga 2940 
atccagtaaa tccaattgac aaagagacaa ctcaagtcaa gtccaaatta gcacctctca 3000 
agttctacaa ctctgatgta acttcatatc tcaatttctt ttttcttatt gtactttatg 3060 
ttcttagtca aattttataa ttaactcttt tcaaattgtc tttttttttc agattcacaa 3120 
agcagcattc attttgccat ctttcgccag aagtatgatc gagtcttaat caagtgacta 3180 
atgaatactg gcggtacaat cattggacca agatcgagtc ttaatcaagt gaataaataa 3240 
gtgaaatgcg acgtattgta taagaattc 3269 

<210> 9 

<211> 381 

<212> PRT 

<213> Plant 

<400> 9 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr lie Phe Lys Asn 
1 5 10 15 
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Gly Ala He Pro Met Asn Gly Tyr Gin Asn Gly Thr Ser Lys His Gin 
20 25 30 

Asn Gly His Gin Asn Gly Thr Ser Glu His Arg Asn Gly His Gin Asn 
35 40 45 

Gly lie Ser Glu His Gin Asn Gly His Gin Asn Gly Thr Ser Glu His 
50 55 60 

Gin Asn Gly His Gin Asn Gly Thr He Ser His Asp Asn Gly Asn Glu 
65 70 75 80 

Leu Gin Leu Leu Gly Ser Ser Asn Ser He Lys Pro Gly Trp Phe Ser 
85 90 95 

Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala Phe Ser Leu Lys Val Glu 
100 105 110 

Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr Gin Asp Val Met Leu Phe 
115 120 125 

Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr Leu Asp Gly Ala He Gin 
130 135 140 

His Thr Glu Asn Gly Gly Phe Pro Tyr Thr Glu Met He Val His Leu 
145 150 155 160 

Pro Leu Gly Ser He Pro Asn Pro Lys Lys Val Leu He He Gly Gly 
165 170 175 

Gly He Gly Phe Thr Leu Phe Glu Met Leu Arg Tyr Pro Thr He Glu 
180 185 190 

Lys He Asp He Val Glu He Asp Asp Val Val Val Asp Val Ser Arg 
195 200 205 

Lys Phe Phe Pro Tyr Leu Ala Ala Asn Phe Ser Asp Pro Arg Val Thr 
210 215 220 

Leu Val Leu Gly Asp Gly Ala Ala Phe Val Lys Ala Ala Gin Ala Gly 
225 230 235 . 240 

Tyr Tyr Asp Ala He He Val Asp Ser Ser Asp Pro He Gly Pro Ala 
245 250 255 

Lys Asp Leu Phe Glu Arg Pro Phe Phe Glu Ala Val Ala Lys Ala Leu 
260 265 270 
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Arg Pro Gly Gly Val Val Cys Thr Gin Ala Glu Ser He Trp Leu His 
275 280 285 

Met His He He Lys Gin He He Ala Asn Cys Arg Gin Val Phe Lys 
290 295 300 

Gly Ser Val Asn Tyr Ala Trp Thr Thr Val Pro Thr Tyr Pro Thr Gly 
305 310 315 320 

Val He Gly Tyr Met Leu Cys Ser Thr Glu Gly Pro Glu Val Asp Phe 
325 330 335 

Lys Asn Pro Val Asn Pro He Asp Lys Glu Thr Thr Gin Val Lys Ser 
340 345 350 

Lys Leu Ala Pro Leu Lys Phe Tyr Asn Ser Asp He His Lys Ala Ala 
355 360 365 

Phe He Leu Pro Ser Phe Ala Arg Ser Met He Glu Ser 
370 375 380 



<210> 10 
<211> 469 
<212> DNA 
<213> Plant 

<400> 10 

gtcgacctct gattccacaa gtcatgcacc cattcaatta tttaatggaa accaatttta 60 

ccctgtacaa atggtacaaa tactttcctt ggataaaaac aattttgcct aaggagtaaa 120 

cagatgcgaa gtaagaaagc agacgactaa agaaaatttt aaaaaaggag agagaaatga 180 

gcacacacac gtactaataa aattagggta ctactttact aataattgga cagagactaa 240 

attcatattt tagttccaaa atgtctcggg cagtccaacc atgcacgttg taatgagttt 300 

ttaactctat tatctcgagt tgcgecctcc actcctctgt gtccaagttg tatataaatg 360 

catatatgtc tattgggagt gtacagcgag ctttcataaa gtacaaatca taatacttgt 420 

tgaaacataa tactttctct tctccaattt gtttagttta attttgaaa 469 

<210> 11 
<211> 3001 
<212> DNA 
<213> Plant 

<400> 11 

gtcgacctct gattccacaa gtcatgcacc cattcaatta tttaatggaa accaatttta 60 

ccctgtacaa atggtacaaa tactttcctt ggataaaaac aattttgcct aaggagtaaa 120 

cagatgcgaa gtaagaaagc agacgactaa agaaaatttt aaaaaaggag agagaaatga 180 

gcacacacac gtactaataa aattagggta ctactttact aataattgga cagagactaa 240 

attcatattt tagttccaaa atgtctcggg cagtccaacc atgcacgttg taatgagttt 300 
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ttaactctat tatctcgagt tgcgccctcc 
catatatgtc tattgggagt gtacagcgag 
tgaaacataa tactttctct tctccaattt 
atctaccaac acaaatggct cgaccatctt 
ccagagtggc acttccaaac acctcaacgg 
cggccaccat aatggcactt ccgaacatcg 
ccaaaacggc caccagaatg ggacttccga 
cgaacaccaa aacggccacc agaatgggac 
gacttccgaa caacagaacg ggacaatcag 
ctccaactct attaagcttg gttggttttc 
ctgagaaaga aactcaaatt catatttaaa 
gattttcttt tagtgattaa ttaaaaaagg 
ccagtaactt cgcatattat tctacacatt 
agcattctcc cttaaggttg agaagttact 
catgctcttt gaggtaaata atattctaat 
aatttacttt tagtttattg cacgtgtacg 
ctttggatgg agcaattcaa cacacagaga 
ttcatcttcc acttggttcc atcccaaacc 
ttggttttac attattcgaa atgcttcgtt 
aaatcgatga cgtggtagtt gatgtaagtc 
ttagattgct tctttttatt tttctaaaag 
aaattctctt tcttacaggt atctagaaaa 
gatcctcgtg taaccctcgt tctcggagat 
atcttttatt tttatttcat ttaatttacc 
ggttctcttt catttcaggg gctgcatttg 
ctattatagt ggactcttct gatcccattg 
atcttattga ataagctact aataaactgt 
tccagcaaaa gatttgtttg agaggccatt 
aggaggagtt gtatgcacac aggccgaaag 
aatcattgct aactgtcgtc aagtctttaa 
tccaacatat cccacgtatt ttctctctct 
gactactttc cttcctttga ttcctcggtt 
aatagataat gacgtccaag aaaaactaac 
aaactttatt tattatttcg ttctaaagag 
tgttttaaac ttttattttt acttattata 
tattatggcc acactaaaca tccaagtttt 
taatttgttt ttggtatgcg ggtgcggagt 
gagcaaattt ttatttcgag taaactttca 
actaatcaat tatgtctcaa ccattttgat 
tctactgaag ggccagaagt tgacttcaag 
actcaagtca agtccaaatt agcacctctc 
ctaacaattt ctttttctgt tttactgtat 
tctcaaatta tctttttttt tagattcaca 
gaagtatgat cgagtcttaa tcaagtgaat 
aagatcgagt cttaatcaag tgaataaata 
c 

<210> 12 



PCTAJS00/124S0 

actcctctgt gtccaagttg tatataaatg 360 

ctttcataaa gtacaaatca taatacttgt 420 

gtttagttta attttgaaaa tggaagtcat 480 

caagaatggt gccattccca tgaatggcca 540 

ctaccagaac ggcacttcca aacaccaaaa 600 

gaacggccac cagaatggga tttccgaaca 660 

acatcggaac ggccaccaga atgggatttc 720 

ttccgaacac caaaacggcc accagaatgg 780 

ccatgacaat ggcaacgagc tactgggaaa 840 

agagtttagc gcattatggc caggttagta 900 

gttaaaattg ttaggctaat ataagaagtt 960 

aaagagtatc aaataaattc caaaaaatga 1020 

aatttgaaat aaatcgaatt ttgcaggtga 1080 

atttcagggg aagtctgact accaagatgt 1140 

acacatgctt taatatgaat aaatactttt 1200 

tacagtcagc aacatatggg aaggttttga 1260 

atggtggatt tccatacact gaaatgattg 1320 

caaaaaaggt tttgatcatc ggcggaggaa 1380 

atcctacaat cgaaaaaatt gacattgttg 1440 

aaatttcttt tactcacata aaaaaatgat 1500 

aataaatata ttctctctta gttttaaaca 1560 

tctttccctt atctcgcagc taattttaat 1620 

ggtgcgtatt tataatctcg tttttgtttt 1680 

tttttgtgtg tggttaattt acccgtcatt 1740 

taaaggctgc acaagcagga tattatgatg 1800 

gtactctatt actacttaat accaagacta 1860 

aattgatttc taaaataata taatttcagg 1920 

ctttgaggca gtagccaaag ccctaaggcc 1980 

catttggctt catatgcata ttattaagca 2040 

gggctctgtc aactacgctt ggactactgt 2100 

ctctcttcat ctttgaaaat tgaaaatcct 2160 

aaaggggcgt agatcataag attttcaaga 2220 

ttcttttcat ttactattct ttttggtgac 2280 

aaaatttatt tttatatttt aaaataattt 2340 

tctttaataa aaaaattata gtcaaataaa 2400 

tgaaaccata agttttagag ccaaatgagt 2460 

caaattatgt cacaaaaatt gtaatggagt 2520 

aggtattgtg ttaaagtttt ttcaactgat 2580 

ttcagtggtg taattgggta tatgctctgc 2640 

aatccaataa atccaattga caaagagaca 2700 

aagttttaca attctgatgt aacttcatat 2760 

cttcattgtc aaaattttat aattaactct 2820 

aagcagcatt cattttgcca tctttcgcca 2880 

aatgaacact ggtggtgcaa tcattggacc 2940 

agtgaaatgc cgacgtattg tatgagaatt 3000 

3001 
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<211> 419 
<212> PRT 
<213> Plant 

<400> 12 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr lie Phe Lys Asn 
15 10 15 

Gly Ala He Pro Met Asn Gly His Gin Ser Gly Thr Ser Lys His Leu 
20 25 30 



Asn Gly Tyr Gin Asn Gly Thr Ser Lys His Gin Asn Gly His His Asn 
35 40 45 

Gly Thr ser Glu His Arg Asn Gly His Gin Asn Gly He Ser Glu His 
50 55 60 

Gin Asn Gly His Gin Asn Gly Thr Ser Glu His Arg Asn Gly His Gin 
65 70 75 80 

Asn Gly He Ser Glu His Gin Asn Gly His Gin Asn Gly Thr Ser Glu 
85 90 95 

His Gin Asn Gly His Gin Asn Gly Thr Ser Glu Gin Gin Asn Gly Thr 
100 105 110 

He Ser His Asp Asn Gly Asn Glu Leu Leu Gly Asn Ser Asn Ser He 
115 120 125 

Lys Leu Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala 
130 135 140 

Phe Ser Leu Lys Val Glu Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr 
145 150 155 160 

Gin Asp Val Met Leu Phe Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr 
165 170 175 

Leu Asp Gly Ala He Gin His Thr Glu Asn Gly Gly Phe Pro Tyr Thr 
180 185 190 

Glu Met He Val His Leu Pro Leu Gly Ser He Pro Asn Pro Lys Lys 
195 200 205 

Val Leu He He Gly Gly Gly He Gly Phe Thr Leu Phe Glu Met Leu 
210 215 220 



Arg Tyr Pro Thr He Glu Lys He Asp He Val Glu He Asp Asp Val 
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225 230 235 240 

Val Val Asp Val Ser Arg Lys Ser Phe Pro Tyr Leu Ala Ala Asn Phe 
245 250 255 

Asn Asp Pro Arg Val Thr Leu Val Leu Gly Asp Gly Ala Ala Phe Val 
260 265 270 

Lys Ala Ala Gin Ala Gly Tyr Tyr Asp Ala He He Val Asp- Ser Ser 
275 280 285 

Asp Pro He Gly Pro Ala Lys Asp Leu Phe Glu Arg Pro Phe Phe Glu 
290 295 300 

Ala Val Ala Lys Ala Leu Arg Pro Gly Gly Val Val Cys Thr Gin Ala 
305 310 315 320 

Glu Ser lie Trp Leu His Met His He He Lys Gin He He Ala Asn 
325 330 335 

cys Arg Gin Val Phe Lys Gly Ser Val Asn Tyr Ala Trp Thr Thr Val 
340 345 350 

Pro Thr Tyr Pro Thr Gly Val He Gly Tyr Met Leu Cys Ser Thr Glu 
355 360 365 

Gly Pro Glu Val Asp Phe Lys Asn Pro He Asn Pro He Asp Lys Glu 
370 375 380 

Thr Thr Gin Val Lys Ser Lys Leu Ala Pro Leu Lys Phe Tyr Asn Ser 
385 390 395 400 

Asp He His Lys Ala Ala Phe He Leu Pro Ser Phe Ala Arg Ser Met 
405 410 415 

He Glu Ser 



<210> 13 
<211> 1636 
<212> DMA 
<213> Plant 

<400> 13 

ggcacgagat cagatccaat tctcttctgt gcttcccttc tctgctctca aattcttcag 60 
atctacaaag ttttcttcat tttcagaggg cagacatgga aactttcttg ttcacctcag 120 
agtcagtcaa tgaaggccac cccgacaagc tctgcgacca ggtctcggat gcaattcttg 180 
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atgcttgctt agaacaggat ccagaaagca aggttgcatg tgaaacctgc acaaagacaa 240 

acatggttat ggtctttgga gagatcacaa ccaaggccac tgttgactat gagaagatag 300 

tgcgtgacac atgcagaggc attgggttca cctcagcaga tgttggcctt gacgctgaca 360 

actgcaaggt tcttgtcaac atcgagcagc agagccctga cattgcccaa ggtgttcacg 420 

gtcatcttac caagaaacca gaagagattg gagctggtga ccaaggtcac atgtttggct 480 

atgccactga tgaaacccca gagctcatgc cccttaccca tgtttgggcc actaagcttg 540 

gtgccaagct taccgaagtg aggaagaaca agacttgccc atggctcaga ccagatggca 600 

agacccaagt tactgttgag tacaagaacg acaatggtgc catggtccca attagagttc 660 

acactgttct catctcaact caacatgacg aaactgtcac aaacgaccag attgcccagg 720 

acttgaaaga gcatgtgatc aaacctgtga tcccatctca gtaccttgat gagaatacca 7 80 

tcttccacct caacccatca ggtcgcttcg tcatcggtgg accacacgga gatgctggac 840 

ttaccggcag gaaaattatc attgacacct acggaggctg gggtgcccat ggaggaggtg 900 

ctttctcagg aaaggaccct actaaggtgg acaggagtgg tgcttatatt gttagacagg 960 

cagcaaagag tgtggtcgcc tcaggacttg ctcgccgctg tattgtgcag gtttcttatg 1020 

ctatcggtgt ggctgaacca ctttccgtgt ttgttgacac ttacaagact ggaacaattc 1080 

cagacaagga tattttgact ctgatcaagg agaactttga cttcaggcct ggaatgatgt 1140 

caatcaacct tgacttgtta agaggaggca acttcaggta ccagaagact gcagcttatg 1200 

gtcactttgg ccgtgatgac cccgacttct catgggagac tgtcaaggtc ctcaagccaa 12 60 

aagcttaagt gaggtgtagc cttttggcca ttatttttct tgcagaccaa taaacaagct 1320 

tcatcatatc atgcattggt ggcaggagaa gagaatttgt gtctccattg gaggattcta 1380 

tgagctctga gtcattgaac attgttattt ttctttcttt ttttttcacc cttttctgca 1440 

gtaccttatt tttattttgt tactgttaag tagcagtgat ttaagttttc cctgttaagt 1500 

agcagtgatt taagttttcc ctgttaagta gctggaatta agtttccatg ttctatcata 1560 

ttatatgtga acttgtcaat tatctcctga ggtgaaagag tccttcaggg aatagtttaa 1620 
aaaaaaaaaa aaaaaa 1636 

<210> 14 

<211> 390 

<212> PRT 

<213> Plant 

<400> 14 

Met Glu Thr Phe Leu Phe Thr Ser Glu Ser Val Asn Glu Gly His Pro 
15 10 15 

Asp Lys Leu Cys Asp Gin Val Ser Asp Ala He Leu Asp Ala Cys Leu 
20 25 30 

Glu Gin Asp Pro Glu Ser Lys Val Ala Cys Glu Thr Cys Thr Lys Thr 
35 40 45 

Asn Met Val Met Val Phe Gly Glu He Thr Thr Lys Ala Thr Val Asp 
50 55 60 

Tyr Glu Lys He Val Arg Asp Thr Cys Arg Gly He Gly Phe Thr Ser 
65 70 75 80 

Ala Asp Val Gly Leu Asp Ala Asp Asn Cys Lys Val Leu Val Asn He 
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85 



90 



95 



Glu Gin Gin Ser Pro Asp lie Ala Gin Gly Val His Gly His Leu Thr 
100 105 110 

Lys Lys Pro Glu Glu lie Gly Ala Gly Asp Gin Gly His Met Phe Gly 
115 120 125 

Tyr Ala Thr Asp Glu Thr Pro Glu Leu Met Pro Leu Thr His Val Trp 
130 135 140 

Ala Thr Lys Leu Gly Ala Lys Leu Thr Glu Val Arg Lys Asn Lys Thr 
145 150 155 160 

Cys Pro Trp Leu Arg Pro Asp Gly Lys Thr Gin Val Thr Val Glu Tyr 
165 170 175 

Lys Asn Asp Asn Gly Ala Met Val Pro lie Arg Val His Thr Val Leu 
180 185 190 

lie Ser Thr Gin His Asp Glu Thr Val Thr Asn Asp Gin He Ala Gin 
195 200 205 

Asp Leu Lys Glu His Val He Lys Pro Val He Pro Ser Gin Tyr Leu 
210 215 220 

Asp Glu Asn Thr He Phe His Leu Asn Pro Ser Gly Arg Phe Val He 
225 230 235 240 

Gly Gly Pro His Gly Asp Ala Gly Leu Thr Gly Arg Lys He He He 
245 250 255 

Asp Thr Tyr Gly Gly Trp Gly Ala His Gly Gly Gly Ala Phe Ser Gly 
260 265 270 

Lys Asp Pro Thr Lys Val Asp Arg Ser Gly Ala Tyr He Val Arg Gin 
275 280 285 

Ala Ala Lys Ser Val Val Ala Ser Gly Leu Ala Arg Arg Cys He Val 
290 295 300 

Gin Val Ser Tyr Ala He Gly Val Ala Glu Pro Leu Ser Val Phe Val 
305 310 315 320 

Asp Thr Tyr Lys Thr Gly Thr He Pro Asp Lys Asp He Leu Thr Leu 
325 330 335 



He Lys Glu Asn Phe Asp Phe Arg Pro Gly Met Met Ser He Asn Leu 
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340 

Asp Leu Leu Arg Gly Gly 
355 

Gly His Phe Gly Arg Asp 

370 

Val Leu Lys Pro Lys Ala 
385 390 



345 

Asn Phe Arg Tyr Gin Lys 
360 

Asp Pro Asp Phe Ser Trp 
375 380 
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350 

Thr Ala Ala Tyr 
365 

Glu Thr Val Lys 



<210> 15 
<211> 1596 
<212> DNA 
<213> Plant 



<400> 15 

ggcacgaggg gaacaagaga aacatcatat 
ttgattcctt cctctcattt acctctctct 
caatcatcgt ttccgggttg aacccggcgg 
cttctcctac agcggcggcg gcggcggaaa 
gagatgcctt acaagatttc atgttatcaa 
aaccttttta cgtgctagac ttgggtgagg 
ctctcccaaa tatccgtcca ttttacgctg 
caattttatc tgctatgggc tcaaattttg 
ttttatctct tggcatttca cctgaccgta 
ccgatattat ttttgcagca aaagttgggg 
aggtttacaa gatccgaaag catcacccga 
tgctcgacgg caacgcgaga tgcccaatgg 
tcgacccgct gctccgggca gctcaagccg 
acatcggtag cggagatgcc gattcaaacg 
aagtgtttga aacagctgct aaactcggga 
gcgggtttac atccggccac cagttcacaa 
aacaacactt cgatgacgaa ccggagttga 
cagagacggc gtttactttg gcaacgacga 
gggagtattg gattaacgac gggctgtacg 
cgacggtgaa tgcaacgccg ttagctgttc 
ggtcgaaaac gtttccgacg actgtgtttg 
taagggatta ccagttaccg gagctgcagg 
gtgcttatac taaagctgct gggtccaatt 
ctcacctcgc ttattcttat ccaagctgat 
gttttgatgg ttttttcctt ttttgggtat 
aatttatatt ccaaatcagc ttgtaattct 
taattgtgat tttctctaaa aaaaaaaaaa 



tattgaatcc ctagtttctt ttctttccct 60 
tttcttcctt tgtttggatg gccggccaaa 120 
ccattcttca gtccacaatt ggcggcggag 180 
acggcaccag aaaagtcatc cctctctcaa 240 
tcataaccca aaaattacaa gatgagaaac 300 
ttgtttctct tatggaccaa tggaaatctg 360 
ttaaatgtaa ccctgaaccg tcgttccttt 420 
attgtgctag ccgagctgaa attgagtatg 480 
ttgttttcgc aaatccatgc aaaccggaat 540 
tgaatcttac aacctatgat tctgaagacg 600 
aatccgaact cttgctccgc atcaagccca 660 
gcccgaaata cggcgcgctt ccagaagaag 720 
cccgtctcac cgtatccggc gtctcattcc 780 
cttatctcgg cgccatagcc gcggctaagg 840 
tgtcgaaaat gactgttcta gacgtcggcg 900 
ccgccgccgt cgccgttaaa tcagctttaa 960 
caatcatagc tgaaccgggt cggttttttg 1020 
ttatagggaa aagagtgagg ggtgaattga 1080 
gttcgatgaa ctgtgtactt tacgaccatg 1140 
tgtcgaatcg tagtaacgtt acctgcggcg 1200 
ggcccacttg tgatgctctt gatactgttt 1260 
ttaatgattg gctggttttt cctaatatgg 1320 
ttaatggatt taatacttcc gccattgtta 1380 
gaaccacctg tattaggaat tactaccgtg 1440 
ctttttttta attttgttgt ttttggtagt 1500 
cttgtatgcc ataagaatgc aaggatttgc 1560 
aaaaaa 1596 



<210> 16 
<211> 433 
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<212> PRT 
<213> Plant 

<400> 16 

Met Ala Gly Gin Thr lie He Val Ser Gly Leu Asn Pro Ala Ala He 
15 10 15 

Leu Gin Ser Thr He Gly Gly Gly Ala Ser Pro Thr Ala Ala Ala Ala 
20 25 30 

Ala Glu Asn Gly Thr Arg Lys Val He Pro Leu Ser Arg Asp Ala Leu 
35 40 45 

Gin Asp Phe Met Leu Ser He He Thr Gin Lys Leu Gin Asp Glu Lys 
50 55 60 

Gin Pro Phe Tyr Val Leu Asp Leu Gly Glu Val Val Ser Leu Met Asp 
65 70 75 80 

Gin Trp Lys Ser Ala Leu Pro Asn He Arg Pro Phe Tyr Ala Val Lys 
85 90 95 

Cys Asn Pro Glu Pro Ser Phe Leu Ser He Leu Ser Ala Met Gly Ser 
100 105 110 

Asn Phe Asp Cys Ala Ser Arg Ala Glu He Glu Tyr Val Leu Ser Leu 
115 120 125 

Gly He Ser Pro Asp Arg He Val Phe Ala Asn Pro Cys Lys Pro Glu 
130 135 140 

Ser Asp He He Phe Ala Ala Lys Val Gly Val Asn Leu Thr Thr Tyr 
145 150 155 160 

Asp Ser Glu Asp Glu Val Tyr Lys He Arg Lys His His Pro Lys Ser 
165 170 175 

Glu Leu Leu Leu Arg He Lys Pro Met Leu Asp Gly Asn Ala Arg Cys 
180 185 190 

Pro Met Gly Pro Lys Tyr Gly Ala Leu Pro Glu Glu Val Asp Pro Leu 
195 200 205 

Leu Arg Ala Ala Gin Ala Ala Arg Leu Thr Val Ser Gly Val Ser Phe 
210 215 220 

His He Gly Ser Gly Asp Ala Asp Ser Asn Ala Tyr Leu Gly Ala He 
225 230 235 240 
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Ala Ala Ala Lys Glu Val Phe Glu Thr Ala Ala Lys Leu Gly .Met Ser 
245 250 255 

Lys Met Thr Val Leu Asp Val Gly Gly Gly Phe Thr Ser Gly His Gin 
260 265 270 

Phe Thr Thr Ala Ala Val Ala Val Lys Ser Ala Leu Lys Gin His Phe 
275 280 285 

Asp Asp Glu Pro Glu Leu Thr lie He Ala Glu Pro Gly Arg Phe Phe 
290 295 300 

Ala Glu Thr Ala Phe Thr Leu Ala Thr Thr He He Gly Lys . Arg Val 
305 310 315 320 

Arg Gly Glu Leu Arg Glu Tyr Trp He Asn Asp Gly Leu Tyr Gly Ser 
325 330 335 

Met Asn Cys Val Leu Tyr Asp His Ala Thr Val Asn Ala Thr Pro Leu 
340 345 350 

Ala Val Leu Ser Asn Arg Ser Asn Val Thr Cys Gly Gly Ser Lys Thr 
355 360 365 

Phe Pro Thr Thr Val Phe Gly Pro Thr Cys Asp Ala Leu Asp Thr Val 
370 375 380 

Leu Arg Asp Tyr Gin Leu Pro Glu Leu Gin Val Asn Asp Trp Leu Val 
385 390 395 400 

Phe Pro Asn Met Gly Ala Tyr Thr Lys Ala Ala Gly Ser Asn Phe Asn 
405 410 415 

Gly Phe Asn Thr Ser Ala He Val Thr His Leu Ala Tyr Ser Tyr Pro 
420 425 430 

Ser 



<210> 17 
<211> 2074 
<212> DNA 
<213> Plant 

<400> 17 

tggtaactgg accgacgcga catttgtcgt atatgtctta atcgggctag tcgctgacaa 60 
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catcatccac caagtcaaag ttcggaaatt catatcgttt ctcatcatct tctatccgag 120 
aaatgagggg actatctgta tacggtcaaa accgagtctg cccttcatat gactaatcga 180 
gattagaaca taatggtcta aggttcatca ttataataac gagccatgat atagagttag 240 
gttgtcaagc tcaagcccca gagagcgatc aatatcgaga tcgagccaag gttaaactcg 300 
agacctagag accgatcaat accgagaccg accaagtcaa ggtcgagctc gagacccaga 360 
gaccggtcaa gattgagatc ggccaagatc gagatcgagc caagaaatta aaaagtcgtt 420 
atagccgcat ttagggagag aatctctgcg gaaatcacga cttgaatcag ggaaaaacta 480 
attaattaat ctatcatgtg atccccacta tgtattttta attatactca aaatgggatt 540 
cccccactat attaagagtg gttatcattt gtaatggaga gacatacaca cattcattct 600 
gacatataca gaaaatagag caaatactat ccttttttgg cttttgatat ttagtcatat 660 
tgtttcttct acccattgtt cttcactcaa tttggaggtg ataaaacttg aaggtttaag 720 
ctaactagtc cattcgggtt gcattcattt cttttacaat aatttcgtca tcatttattt 78 0 
attttctcaa ttgtactaag ttataccacg tatttttaga actgcgtata aattcaactc 840 
tatccatttt tcgggtaaac accgaataca tagcacaata gcaccctcaa ttgcaaaagt 900 
ccaaagccaa gggttcattc ctttctgaag aaatgagata gagaattgaa aatctaattt 960 
agttatctaa atctttataa tttagccttc catataagaa aaaggaaaca aattaactga 1020 
agaacaatag cctcgcatag atttaccttc tccatataaa ttttgtttat actcaatttt 1080 
tttgcaaatg tgtctaaaat gataggactt gcaaattttt atttaacatt tcctactcct 1140 
ctttaatttt caagaaatta attttaagca ttctcgattt gctctgcccg ctccgtcccg 1200 
ttgccatctc tgactcggat aggacctcga ttgcaaaaat ccaaacccaa ggaaccttcc 1260 
atacattaca taagccacaa aatagtaact attaaaaact accaatatat cctcaaatac 1320 
tcgcgattat ttcataccta acacgtttac cttatcttct cgtaatgacg ctacattagt 1380 
tagtgatata aaataccgaa tttaccacgc ggcaaccctc cgctgtctat ccacggcccg 1440 
agagaatctc ttagcccccc aaatacgaaa attaacttct agaattttat tttctggtta 1500 
ttaccatgaa aataaagaaa aagagaaaag tcaagaaatt taattgggct aatactgggg 1560 
tccactgccc agccacgcat ttccctccta tataaagcgt cgtcacctct catgcaaatc 1620 
tcgctcactt cacagttgtt agtttcacgt tctcttctca attcccataa aagaaaccct 1680 
tccgttaggt ttccgtccta ttttctcttc ttctacgctt cctcttctga tatcaatatc 1740 
tgtatggtgt ttttcttgtt cgaattttag atttgttttg cctttaatac ctgtaacctt 1800 
ataattctct gtttaaacca aaaacttagc ttcttctgaa gtcagggtgg ggatatttgg 1860 
atcgtgtaag agtgtgttag aaggtgatta tcttttgatt cagttccttt tttgcttctt 1920 
ttgagggggt agccggggcc tcggcctcgg cgggttttaa tagcccccat ctattacaac 1980 
cattgggcaa aaacatcatt aaatctgtac aaagcaaacc cttaatttag tttaattttc 2040 
tgtattcttt gattctttaa cagaagaaga agag 2074 

<210> 18 
<211> 4321 
<212> DMA 
<213> Plant 

<400> 18 

tggtaactgg accgacgcga catttgtcgt atatgtctta atcgggctag tcgctgacaa 60 

catcatccac caagtcaaag ttcggaaatt catatcgttt ctcatcatct tctatccgag 120 

aaatgagggg actatctgta tacggtcaaa accgagtctg cccttcatat gactaatcga 180 

gattagaaca taatggtcta aggttcatca ttataataac gagccatgat atagagttag 240 

gttgtcaagc tcaagcccca gagagcgatc aatatcgaga tcgagccaag gttaaactcg 300 

agacctagag accgatcaat accgagaccg accaagtcaa ggtcgagctc gagacccaga 360 

gaccggtcaa gattgagatc ggccaagatc gagatcgagc caagaaatta aaaagtcgtt 420 
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atagccgcat ttagggagag aatctctgcg 
attaattaat ctatcatgtg atccccacta 
cccccactat attaagagtg gttatcattt 
gacatataca gaaaatagag caaatactat 
tgtttcttct acccattgtt cttcactcaa 
ctaactagtc cattcgggtt gcattcattt 
attttctcaa ttgtactaag ttataccacg 
tatccatttt tcgggtaaac accgaataca 
ccaaagccaa gggttcattc ctttctgaag 
agttatctaa atctttataa tttagccttc 
agaacaatag cctcgcatag atttaccttc 
tttgcaaatg tgtctaaaat gataggactt 
ctttaatttt caagaaatta attttaagca 
ttgccatctc tgactcggat aggacctcga 
atacattaca taagccacaa aatagtaact 
tcgcgattat ttcataccta acacgtttac 
tagtgatata aaataccgaa tttaccacgc 
agagaatctc ttagcccccc aaatacgaaa 
ttaccatgaa aataaagaaa aagagaaaag 
tccactgccc agccacgcat ttccctccta 
tcgctcactt cacagttgtt agtttcacgt 
tccgttaggt ttccgtccta ttttctcttc 
tgtatggtgt ttttcttgtt cgaattttag 
ataattctct gtttaaacca aaaacttagc 
atcgtgtaag agtgtgttag aaggtgatta 
ttgagggggt agccggggcc tcggcctcgg 
cattgggcaa aaacatcatt aaatctgtac 
tgtattcttt gattctttaa cagaagaaga 
cgctactgtt tcccctcctc tcggctatgc 
ggagttcttt acctccggcg tacctcctac 
ggatctgtcc tctgctttgt acggggtcga 
ctctaacgga gatatctccg tccgaccaca 
tgaccttctc aaggtcgtga aaaaggcctc 
tcagctgcct cttgttgttc gcttccctga 
atcggctttt gatctcgctg ttcattccca 
tcccgtgaaa tgcaatcaag acaggttcgt 
attccggttc gggttggaag ctgggtctaa 
ctgcaggggc agtgctgagg gccttctcgt 
ttcgcttgct ttggttgcaa gaaagctcat 
ggaggagctt gaccttgtga ttgatataag 
acttcgggct aagctcagga ccaagcattc 
aggtaagttt gggcttacaa cgacccaaat 
cggaatgctg gattgccttc agttgctgca 
ggcgttgctt gctgatggtg ttggtgaggc 
tggtgcgggt atgaagttca ttgatactgg 
taaatcatgt gattcagatg tctctgttgg 
tgtccaggcg gttcaatatg tttgcgaccg 
cgaaagtggc agggcaattg tttctcatca 
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gaaatcacga cttgaatcag ggaaaaacta 480 
tgtattttta attatactca aaatgggatt 540 
gtaatggaga gacatacaca cattcattct 600 
ccttttttgg cttttgatat ttagtcatat 660 
tttggaggtg ataaaacttg aaggtttaag 720 
cttttacaat aatttcgtca tcatttattt 780 
tatttttaga actgcgtata aattcaactc 840 
tagcacaata gcaccctcaa ttgcaaaagt 900 
aaatgagata gagaattgaa aatctaattt 960 
catataagaa aaaggaaaca aattaactga 1020 
tccatataaa ttttgtttat actcaatttt 1080 
gcaaattttt atttaacatt tcctactcct 1140 
ttctcgattt gctctgcccg ctccgtcccg 1200 
ttgcaaaaat ccaaacccaa ggaaccttcc 1260 
attaaaaact accaatatat cctcaaatac 1320 
cttatcttct cgtaatgacg ctacattagt 1380 
ggcaaccctc cgctgtctat ccacggcccg 1440 
attaacttct agaattttat tttctggtta 1500 
tcaagaaatt taattgggct aatactgggg 1560 
tataaagcgt cgtcacctct catgcaaatc 1620 
tctcttctca attcccataa aagaaaccct 1680 
ttctacgctt cctcttctga tatcaatatc 1740 
atttgttttg cctttaatac ctgtaacctt 1800 
ttcttctgaa gtcagggtgg ggatatttgg 1860 
tcttttgatt cagttccttt tttgcttctt 1920 
cgggttttaa tagcccccat ctattacaac 1980 
aaagcaaacc cttaatttag tttaattttc 2040 
agagatgccg gccctaggtt gttgcgtaga 2100 
cttctctcgg gatagctctc ttcccgcgcc 2160 
aaactccgcc gccggttcca ttgggtctcc 2220 
tgggtgggga gctccttatt tctccgttaa 2280 
tggtacggac acactccccc accaggaaat 2340 
cgacccgaaa aattcagggg ggctcgggct 2400 
tgtgctaaaa aaccggttgg aatctctgca 2460 
gggctatggg gcccactacc aaggtgttta 2520 
ggtggaagat attgtcaaat tcgggtcgtc 2580 
acccgagctc ctgttagcca tgagctgtct 2640 
ttgcaatggt ttcaaggacg ctgagtacat 27 00 
gttaaacact gtaattgttc ttgaacaaga 2760 
ccgtaagatg gctgttcggc ccgtaattgg 2820 
aggccatttt ggatccactt ctggagaaaa 2880 
tgttcgtgta gtgaagaagc tggaagaatc 2940 
ttttcacatt ggatctcaga tcccttcaac 3000 
tgctcagatt tattgtgaat taatccgtct 3060 
aggtgggctc ggaattgatt atgatggtac 3120 
ctatggcatt caagaatacg cctccacagt 3180 
taagggcgtg aagcacccag tgatttgcag 3240 
ctcaattctg attttcgaag ccgtgtctgc 3300 
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ttctagtcac 
gacgctcaat 
agagtacgag 
taaagaaggg 
atcaaaggct 
cacttcaatt 
tttagatgaa 
gaaggttgat 
tggcgatggt 
cggaggactc 
cgctcacagc 
agcgatgcag 
cttggaacaa 
gtccttccat 
tgctaacaac 
aggggaggat 
tttagtttgt 
t 

<210> 19 
<211> 720 
<212> PRT 
<213> Plant 

<400> 19 

Met Pro Ala Leu Gly Cys Cys Val Asp Ala Thr Val Ser Pro Pro Leu 
15 10 15 

Gly Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Glu Phe Phe 
20 25 30 

Thr Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Gly Ser lie Gly Ser 
35 40 45 

Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp Gly Ala Pro 
50 55 60 

Tyr Phe Ser Val Asn Ser Asn Gly Asp He Ser Val Arg Pro His Gly 
65 70 75 80 

Thr Asp Thr Leu Pro His Gin Glu He Asp Leu Leu Lys Val Val Lys 
85 90 95 

Lys Ala Ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu Pro 
100 105 110 

Leu Val Val Arg Phe Pro Asp Val Leu Lys Asn Arg Leu Glu Ser Leu 
115 120 125 



tcatgttctt cttcacatct gtcttctggt 
gaagatgccc ttgctgatta ccgcaattta 
acgtgtgtac tttactctga tcagttgaaa 
tccttgggta ttgaacatct tgctgctgtt 
atgggggctg ctgatcctat ccgcacttac 
cctgattttt gggcctttgg tcaattgttt 
aagcctgcag taaggggaat attatcggac 
aagttcattg gtggcgaatc aagcttgcag 
ggtgggtatt atctggggat gtttttgggt 
cacaacctgt ttggtggacc aagcgtggtg 
ttcgccatgt ctcgctccgt ccctggcccg 
cacgagcccg agctcatgtt cgagactctc 
gaagaagaca aagggctggc cattgcatct 
aacatgcctt accttgtggc gcctgcatct 
ggtggctata actactatta cagtgatgag 
gagatttggt cctattgcac tgcttgaagt 
cgtcgaagtt gtctgttttt gaataatacc 



PCT/USOO/12450 

ggcctccaat ccatggcgga 3360 
tctgctgctg cagttcgtgg 3420 
cagagatgtg tggatcagtt 3480 
gatagcatct gtgattttgt 3540 
catgtgaatc tgtcaatttt 3600 
ccgattgttc caatacaccg 3660 
ttgacttgtg acagtgatgg 3720 
ctgcatgaat tgggaagtaa 3780 
ggggcttatg aggaggcgct 3840 
cgcgtggtgc agagcgatag 3900 
tcctgcgcgg acgtgctccg 3960 
aagcaccgtg cggaggaatt 4020 
ttggccagca gcttagctca 4080 
tgctgcttca ctgcagttac 4140 
aatgcagcag attctgctac 4200 
gttgtcgtag catctccagt 4260 
cttagttggt gatgtttttc 4320 

4321 
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Gin Ser Ala Phe Asp Leu Ala Val His Ser Gin Gly Tyr Gly Ala His 
130 135 140 

Tyr Gin Gly Val Tyr Pro Val Lys Cys Asn Gin Asp Arg Phe Val Val 
145 150 155 160 

Glu Asp lie Val Lys Phe Gly Ser Ser Phe Arg Phe Gly Leu Glu Ala 
165 170 175 

Gly Ser Lys Pro Glu Leu Leu Leu Ala Met Ser Cys Leu Cys Arg Gly 
180 185 190 

ser Ala Glu Gly Leu Leu Val Cys Asn Gly Phe Lys Asp Ala .Glu Tyr 
195 200 205 

lie Ser Leu Ala Leu Val Ala Arg Lys Leu Met Leu Asn Thr Val He 
210 215 220 

Val Leu Glu Gin Glu Glu Glu Leu Asp Leu Val He Asp He Ser Arg 
225 230 235 240 

Lys Met Ala Val Arg Pro Val He Gly Leu Arg Ala Lys Leu Arg Thr 
245 250 255 

Lys His Ser Gly His Phe Gly Ser Thr Ser Gly Glu Lys Gly Lys Phe 
260 265 270 

Gly Leu Thr Thr Thr Gin He Val Arg Val Val Lys Lys Leu Glu Glu 
275 280 285 

Ser Gly Met Leu Asp Cys Leu Gin Leu Leu His Phe His He Gly Ser 
290 295 300 

Gin He Pro Ser Thr Ala Leu Leu Ala Asp Gly Val Gly Glu Ala Ala 
305 310 315 320 

Gin He Tyr Cys Glu Leu He Arg Leu Gly Ala Gly Met Lys Phe He 
325 330 335 

Asp Thr Gly Gly Gly Leu Gly He Asp Tyr Asp Gly Thr Lys Ser Cys 
340 345 350 

Asp Ser Asp Val Ser Val Gly Tyr Gly He Gin Glu Tyr Ala Ser Thr 
355 360 365 

Val Val Gin Ala Val Gin Tyr Val Cys Asp Arg Lys Gly Val Lys His 
370 375 380 
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Pro Val He Cys Ser Glu Ser Gly Arg Ala He Val Ser His His Ser 
385 390 395 400 

He Leu He Phe Glu Ala Val Ser Ala Ser Ser His Ser Cys Ser Ser 
405 410 415 

Ser His Leu Ser Ser Gly Gly Leu Gin Ser Met Ala Glu Thr Leu Asn 
420 425 430 

Glu Asp Ala Leu Ala Asp Tyr Arg Asn Leu Ser Ala Ala Ala Val Arg 
435 440 445 

Gly Glu Tyr Glu Thr cys Val Leu Tyr Ser Asp Gin Leu Lys Gin. Arg 
450 455 460 

cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly He Glu His Leu Ala 
465 470 475 480 

Ala Val Asp Ser He Cys Asp Phe Val Ser Lys Ala Met Gly Ala Ala 
485 490 495 

Asp Pro He Arg Thr Tyr His Val Asn Leu Ser He Phe Thr Ser He 
500 505 510 

Pro Asp Phe Trp Ala Phe Gly Gin Leu Phe Pro He Val Pro He His 
515 520 525 

Arg Leu Asp Glu Lys Pro Ala Val Arg Gly He Leu Ser Asp Leu Thr 
530 535 540 

Cys Asp Ser Asp Gly Lys Val Asp Lys Phe He Gly Gly Glu Ser Ser 
545 550 555 560 

Leu Gin Leu His Glu Leu Gly Ser Asn Gly Asp Gly Gly Gly Tyr Tyr 
565 570 575 

Leu Gly Met Phe Leu Gly Gly Ala Tyr Glu Glu Ala Leu Gly Gly Leu 
580 585 590 

His Asn Leu Phe Gly Gly Pro Ser Val Val Arg Val Val Gin Ser Asp 
595 600 605 

Ser Ala His Ser Phe Ala Met Ser Arg Ser Val Pro Gly Pro Ser Cys 
610 615 620 

Ala Asp Val Leu Arg Ala Met Gin His Glu Pro Glu Leu Met Phe Glu 
625 630 635 640 
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Thr Leu Lys His Arg Ala Glu Glu Phe Leu Glu Gin Glu Glu Asp . Lys 
645 650 655 

Gly Leu Ala He Ala Ser Leu Ala Ser Ser Leu Ala Gin Ser Phe His 
660 665 670 

Asn Met Pro Tyr Leu Val Ala Pro Ala Ser Cys Cys Phe Thr Ala Val 
675 680 685 

Thr Ala Asn Asn Gly Gly Tyr Asn Tyr Tyr Tyr Ser Asp Glu Asn Ala 
690 695 700 

Ala Asp Ser Ala Thr Gly Glu Asp Glu He Trp Ser Tyr Cys Thr Ala 
705 710 715 720 



<210> 20 
<211> 2118 
<212> DNA 
<213> Plant 

<400> 20 

gaattcctta tccggatttc tggtacgcag actgtaatat ggagtcatct tctcctcgat 60 
tcgggattaa aattaggtga cttgggacac cctaaatctc ccaagtggcg actctgaaat 120 
aaataaacaa atcccgtttc gattgtcctt aaattggaaa aaactccctt gtaccctccc 180 
gggtacggaa aaaggaggtg tacagcaatg acccaaaact tttattgcta tacattttga 240 
ggaatcaact tgatcaaaat ttatgggtga aattcaatgt ggtatgattt atattaggtc 300 
ggactttagc agatgtggtc acttcaattt gcggcaaaaa taatgtacag ggataataat 360 
aaaaagtact agaaatttga gtcataaagc tttttcaatt ttacaaaaga tattaagata 420 
cttattaaat caaatgtact ttattaatgt aatagcatga aaaaacagcc tcatccgcct 480 
gtcctcaccc cacaaaaagg agatagagaa aggaaactaa tcttatttaa ttttccacat 540 
ataaaattta tatccttgta taaatcccca aaaaaaaaaa atcaatacta attattttaa 600 
tttaatcatc cgtataagaa agaagctaat taactgactt acaaactgaa tagatagcac 660 
aatagcactc tcaattacaa aaatccaaag ccgagggtca ttcctttcat caagaaatta 720 
gatagggaat ggaaaatata atttaattat ctgaatcttt ataatttatc cttccatata 780 
agaaaaagga aacaaattaa ctgaagagca tatagcctcg catagattta ccttctccat 840 
atgggtgggg aaaccgacaa accgcaccaa ttcgataatt cgagtcaaac tgaggaaaaa 900 
aaaattcgac tatggtttgg tttgatttgg tttattgttg ggataaaaaa tcgatcataa 960 
ttggtttggt ttggttttaa ctaaagaaag tcaaaccgaa accaaacaaa cccgacatta 1020 
catatataaa ttttttagat atatttaata tataaatata cttgttgtga tgtaatttat 1080 
aaatatttct taaaaatatt cataatttta tcttttaaga tattatttcg tacttagaac 1140 
ttttgaatgt ttcttactcc tctttaattt gttagaaatt aatttgaagg agtttgaatt 1200 
tgctccgccc ccgctccgtc ccgttgccat ccctgactca ataggataac agcaatctcg 1260 
attgcaaaaa tccaaaccca aggaaccttc ccaacattac ataagctaca aagtagagta 1320 



25 



WQ0067558 [ file:/A\dcwas03\ftrmdata\lp\FolevPat\PatentDocuments\WO0067558.CPC l 



Page 90 of 103 



WO 00/67558 

gtttattaaa taactaccaa tatatcctca 
cttaccttat cttctcgtaa tgacgctaca 
cacgcggcaa tcctccgctg tctatccacg 
tgaaaattaa cttctagaat tttattttct 
aaaaagagaa aagtaaagat atttaattgg 
catttccctc ctatataaag cgtcgtcacc 
gttagtttca cgttctcttc tcaattccca 
ctatttttcc tcatcttctc cgtttcctct 
ttgttcgaat tttagatttg ttttgtcttt 
accaaaaact tagcttcttc tgaagtcagg 
ttagagggtg attatctttt gattcagttc 
ggcctcggcc tcggcgggtt ttaatagccc 
cattaaatct gtacaaaaca aacccttaat 
ttaacagaag aagaagag 

<210> 21 
<211> 4368 
<212> DNA 
<213> Plant 

<400> 21 

gaattcctta tccggatttc tggtacgcag 
tcgggattaa aattaggtga cttgggacac 
aaataaacaa atcccgtttc gattgtcctt 
gggtacggaa aaaggaggtg tacagcaatg 
ggaatcaact tgatcaaaat ttatgggtga 
ggactttagc agatgtggtc acttcaattt 
aaaaagtact agaaatttga gtcataaagc 
cttattaaat caaatgtact ttattaatgt 
gtcctcaccc cacaaaaagg agatagagaa 
ataaaattta tatccttgta taaatcccca 
tttaatcatc cgtataagaa agaagctaat 
aatagcactc tcaattacaa aaatccaaag 
gatagggaat ggaaaatata atttaattat 
agaaaaagga aacaaattaa ctgaagagca 
atgggtgggg aaaccgacaa accgcaccaa 
aaaattcgac tatggtttgg tttgatttgg 
ttggtttggt ttggttttaa ctaaagaaag 
catatataaa ttttttagat atatttaata 
aaatatttct taaaaatatt cataatttta 
ttttgaatgt ttcttactcc tctttaattt 
tgctccgccc ccgctccgtc ccgttgccat 
attgcaaaaa tccaaaccca aggaaccttc 
gtttattaaa taactaccaa tatatcctca 
cttaccttat cttctcgtaa tgacgctaca 
cacgcggcaa tcctccgctg tctatccacg 
tgaaaattaa cttctagaat tttattttct 
aaaaagagaa aagtaaagat atttaattgg 



PCT/USOO/12450 

aattctcgcg attatttcat acctaacacg 1380 
ttagttggtg atataaaata ccgaatttgc 1440 
gcccgagaga atctcttagc cccccaaaga 1500 
ggttattacc atgaaaataa ttaaataaaa 1560 
gctaaaactg gggtccacgg cccagccacg 1620 
tctcatgcaa atctcgctca ctacacagtt 1680 
taacagaaac ccttccgtta ggtttccgtc 17 40 
tctgaaatca atatctgtat ggtgtttttc 1800 
aatacctata accttaaatt ctctgtttaa 18 60 
gtggggattt ttggatcgtg taagagtgtg 1920 
cttttttgct tcttttgagg gggtagccgg 1980 
ccatctatta caactattgg gcaaaaacat 2040 
ttagtttaat tttctgtatt cattgatttt 2100 

2118 



actgtaatat ggagtcatct tctcctcgat 60 
cctaaatctc ccaagtggcg actctgaaat 120 
aaattggaaa aaactccctt gtaccctccc 180 
acccaaaact tttattgcta tacattttga 240 
aattcaatgt ggtatgattt atattaggtc 300 
gcggcaaaaa taatgtacag ggataataat 360 
tttttcaatt ttacaaaaga tattaagata 420 
aatagcatga aaaaacagcc tcatccgcct 480 
aggaaactaa tcttatttaa ttttccacat 540 
aaaaaaaaaa atcaatacta attattttaa 600 
taactgactt acaaactgaa tagatagcac 660 
ccgagggtca ttcctttcat caagaaatta 720 
ctgaatcttt ataatttatc cttccatata 780 
tatagcctcg catagattta ccttctccat 840 
ttcgataatt cgagtcaaac tgaggaaaaa 900 
tttattgttg ggataaaaaa tcgatcataa 960 
tcaaaccgaa accaaacaaa cccgacatta 1020 
tataaatata cttgttgtga tgtaatttat 1080 
tcttttaaga tattatttcg tacttagaac 1140 
gttagaaatt aatttgaagg agtttgaatt 1200 
ccctgactca ataggataac agcaatctcg 1260 
ccaacattac ataagctaca aagtagagta 132 0 
aattctcgcg attatttcat ficctaacacg 1380 
ttagttggtg atataaaata ccgaatttgc 1440 
gcccgagaga atctcttagc cccccaaaga 1500 
ggttattacc atgaaaataa ttaaataaaa 1560 
gctaaaactg gggtccacgg cccagccacg 1620 
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catttccctc ctatataaag cgtcgtcacc 
gttagtttca cgttctcttc tcaattccca 
ctatttttcc tcatcttctc cgtttcctct 
ttgttcgaat tttagatttg ttttgtcttt 
accaaaaact tagcttcttc tgaagtcagg 
ttagagggtg attatctttt gattcagttc 
ggcctcggcc tcggcgggtt ttaatagccc 
cattaaatct gtacaaaaca aacccttaat 
ttaacagaag aagaagagat gccggcccta 
cctcctctca gctatgcctt ctctcgggat 
tccggcgtac ctccgacaaa ttctgccgct 
gctttatacg gggtcgatgg gtggggagct 
atctccgtcc gaccacacgg tacggacact 
gtcgtgaaaa aggcctccga cccgaaaaat 
gttgttcgct tccctgatgt gttgaaaaac 
ctcgcggttc attcccaggg ctatggggcc 
aatcaagaca ggttcgtggt ggaagatatc 
ttggaagccg ggtctaaacc cgagctcctg 
gctgagggcc ttctcgtttg caatggtttc 
gttgcaagaa agctcatgtt aaacactgta 
cttgtgattg atataagcca taagatggct 
ctcaggacca agcattcagg ccattttgga 
cttacaacga cccaaattgt tcgtgtggtg 
tgtcttcagt tgctgcattt tcacattgga 
gatggagttg gtgaggccgc tcagatttat 
aagttcattg atattggagg tgggcttgga 
tctgatgtct ctgttggcta tggcattcaa 
caatatgtat gcgaccgtaa gggtgtgaaa 
gcaattgttt ctcatcactc aattctgatt 
tgttcttctt cacatctgtc ttctggtggc 
gatgcccttg ctgattaccg caatttatct 
tgtgtacttt actctgatca gttgaaacag 
ttgggtattg aacatcttgc tgctgttgat 
ggggctgctg atcctgtccg cacttaccat 
gatttttggg cctttggtca attgtttccg 
cctgcagtga ggggaatatt atcggactta 
ttcattggtg gcgaatcaag cttgccgcta 
ggttattatc tggggatgtt tttgggtggg 
aacctgtttg gtggaccaag tgtcgtgcgc 
gccatgactc gctccgtccc tggcccgtct 
gagcccgagc tcatgttcga gactctcaag 
gatgacaaag ggctggctgt tgaatctttg 
atgccttacc ttgtggcgcc ttcatcttgc 
ggctataatt actattacag tgatgagaat 
atttggtcct attgcactgc ttgaagtgtt 
cgaggttgtc tgtttttgaa taataccctt 

<210> 22 



PCTAJSO0/124SO 

tctcatgcaa atctcgctca ctacacagtt 1680 
taacagaaac ccttccgtta ggtttccgtc 1740 
tctgaaatca atatctgtat ggtgtttttc 1800 
aatacctata accttaaatt ctctgtttaa I860 
gtggggattt ttggatcgtg taagagtgtg 1920 
cttttttgct tcttttgagg gggtagccgg 1980 
ccatctatta caactattgg gcaaaaacat 2040 
ttagtttaat tttctgtatt cattgatttt 2100 
ggttgttgtg tagatgctgc tgttgtttcc 2160 
agctctcttc ccgcgccgga gttctttgcc 2220 
gcttccattg ggtctccgga tttgtcgtct 2280 
ccttatttct ctgttaactc taatggagat 2340 
ctccctcacc aggaaattga ccttctcaag 2400 
tcaggtgggc ttgggcttca gctgcctctt 2460 
cggttggaat ctctgcaatc ggcttttgat 2520 
cactaccaag gtgtttatcc cgtgaaatgc 2580 
gtgaaattcg ggtcgccatt ccggttcggg 2640 
ttagccatga gctgtctctg caagggcagt 2700 
aaggacgctg agtacatttc gcttgctttg 2760 
attgtgcttg aacaagagga ggagcttgac 2820 
gttcggcctg taattggact tcgggctaag 2880 
tccacttctg gagaaaaagg taagtttggg 2940 
aagaagctag aagaatccgg aatgctggat 3000 
tctcagatcc cttctacggg gttgctagct 3060 
tgtgaattag tccgtcttgg agcgggtatg 3120 
attgattatg atggtactaa atcatgcgat 3180 
gaatatgcct ccgcagttgt tcaggcggtt 3240 
cacccagtga tctgcagcga aagtggcagg 3300 
ttcgaagccg tgtctgcttc tagtcactca 3360 
ctccaatcca tggcggagac gctcaacgaa 3420 
gctgctgcag ttcgtggaga gtatgagaca 34 80 
agatgtgtgg atcagtttaa agaagggtcc 3540 
agcatctgtg attttgtatc aaaggctatg 3600 
gtgaatctgt caattttcac ttcaattcct 3660 
attgttccaa ttcaccgctt agatgaaaag 3720 
acttgtgaca gtgatgggaa ggttgataag 3780 
catgaattgg gaagtaatgg cgatggtggt 3840 
gcttatgagg aggcgctcgg aggactccac 3900 
gtggtgcaga gcgatagcgc tcacagcttt 3960 
tgcgctgatg tgctccgagc gatgcagcac 4020 
caccgtgcgg aggaattctt ggaacaagaa 4080 
gccagcagcg tagctcagtc cttccataac 4140 
cgcttcactg ctgctactga taacaatggt 4200 
gcagcagatt ctgctacagg ggaggatgag 42 60 
ctcgtagcat ctccagtctt agtttgtcgt 4320 
agttggtgat gtttttct 4368 
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<211> 721 
<212> PRT 
<213> Plant 

<400> 22 

Met Pro Ala Leu Gly Cys cys Val Asp Ala Ala Val Val Ser. Pro Pro 
15 10 15 

Leu Ser Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Glu Phe 
20 25 30 

Phe Ala Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Ala Ser lie Gly 
35 40 45 

Ser Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp Gly Ala 
50 55 60 

Pro Tyr Phe Ser Val Asn Ser Asn Gly Asp He Ser Val Arg Pro. His 
65 70 75 80 

Gly Thr Asp Thr Leu Pro His Gin Glu He Asp Leu Leu Lys Val Val 
85 90 95 

Lys Lys Ala Ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu 
100 105 110 

Pro Leu Val Val Arg Phe Pro Asp Val Leu Lys Asn Arg Leu Glu Ser 
115 120 125 

Leu Gin Ser Ala Phe Asp Leu Ala Val His Ser Gin Gly Tyr Gly Ala 
130 135 140 

His Tyr Gin Gly Val Tyr Pro Val Lys Cys Asn Gin Asp Arg Phe Val 
145 150 155 160 

Val Glu Asp He Val Lys Phe Gly Ser Pro Phe Arg Phe Gly Leu Glu 
165 170 175 

Ala Gly Ser Lys Pro Glu Leu Leu Leu Ala Met Ser Cys Leu Cys Lys 
180 185 190 

Gly Ser Ala Glu Gly Leu Leu Val Cys Asn Gly Phe Lys Asp Ala Glu 
195 200 205 

Tyr He Ser Leu Ala Leu Val Ala Arg Lys Leu Met Leu Asn Thr Val 
210 215 220 

He Val Leu Glu Gin Glu Glu Glu Leu Asp Leu Val He Asp He Ser 
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225 230 235 240 

His Lys Met Ala Val Arg Pro Val lie Gly Leu Arg Ala Lys Leu Arg 
245 250 255 

Thr Lys His Ser Gly His Phe Gly Ser Thr Ser Gly Glu Lys Gly Lys 
260 265 270 

Phe Gly Leu Thr Thr Thr Gin He Val Arg Val Val Lys Lys Leu Glu 
275 280 285 

Glu Ser Gly Met Leu Asp Cys Leu Gin Leu Leu His Phe His He Gly 
290 295 300 

Ser Gin He Pro Ser Thr Gly Leu Leu Ala Asp Gly Val Gly Glu Ala 
305 310 315 320 



Ala Gin He Tyr Cys Glu Leu Val Arg Leu Gly Ala Gly Met Lys Phe 
325 330 335 

He Asp He Gly Gly Gly Leu Gly He Asp Tyr Asp Gly Thr Lys Ser 
340 345 350 

Cys Asp Ser Asp Val Ser Val Gly Tyr Gly He Gin Glu Tyr Ala Ser 
355 360 365 

Ala Val Val Gin Ala Val Gin Tyr Val Cys Asp Arg Lys Gly Val Lys 
370 375 380 

His Pro Val He Cys Ser Glu Ser Gly Arg Ala He Val Ser His His 
385 390 395 400 

Ser He Leu He Phe Glu Ala Val Ser Ala Ser Ser His Ser Cys Ser 
405 410 415 



Ser Ser His Leu Ser Ser Gly Gly Leu Gin Ser Met Ala Glu Thr Leu 
420 425 430 



Asn Glu Asp Ala Leu Ala Asp Tyr Arg Asn Leu Ser Ala Ala Ala Val 
435 440 445 

Arg Gly Glu Tyr Glu Thr Cys Val Leu Tyr Ser Asp Gin Leu Lys Gin 
450 455 460 

Arg Cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly He Glu His Leu 
465 470 475 480 

Ala Ala Val Asp Ser He Cys Asp Phe Val Ser Lys Ala Met Gly Ala 
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Ala Asp Pro Val Arg Thr Tyr His Val Asn Leu Ser lie Phe Thr Ser 
500 505 510 

lie Pro Asp Phe Trp Ala Phe Gly Gin Leu Phe Pro He Val Pro He 
515 520 525 

His Arg Leu Asp Glu Lys Pro Ala Val Arg Gly He Leu Ser Asp Leu 
530 535 540 

Thr Cys Asp Ser Asp Gly Lys Val Asp Lys Phe He Gly Gly Glu Ser 
545 550 555 560 

Ser Leu Pro Leu His Glu Leu Gly Ser Asn Gly Asp Gly Gly Gly Tyr 
565 570 575 

Tyr Leu Gly Met Phe Leu Gly Gly Ala Tyr Glu Glu Ala Leu Gly Gly 
580 585 590 

Leu His Asn Leu Phe Gly Gly Pro Ser Val Val Arg Val Val Gin Ser 
595 600 605 

Asp Ser Ala His Ser Phe Ala Met Thr Arg Ser Val Pro Gly Pro Ser 
610 615 620 

Cys Ala Asp Val Leu Arg Ala Met Gin His Glu Pro Glu Leu Met Phe 
625 630 635 640 

Glu Thr Leu Lys His Arg Ala Glu Glu Phe Leu Glu Gin Glu Asp Asp 
645 650 655 

Lys Gly Leu Ala Val Glu Ser Leu Ala Ser Ser Val Ala Gin Ser Phe 
660 665 670 

His Asn Met Pro Tyr Leu Val Ala Pro Ser Ser Cys Arg Phe Thr Ala 
675 680 685 

Ala Thr Asp Asn Asn Gly Gly Tyr Asn Tyr Tyr Tyr Ser Asp Glu Asn 
690 695 700 

Ala Ala Asp Ser Ala Thr Gly Glu Asp Glu He Trp Ser Tyr Cys Thr 
705 710 715 720 

Ala 
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<210> 23 
<211> 2695 
<212> DMA 
<213> Plant 



<400> 23 

' ttcacgttct cttctcaatt cccataaaag aaacccttcc gttaggtttc cgtcctattt 60 
tctcttcttc tacgcttcct cttctgatat caatatctgt atggtgtttt tcttgttcga 120 
attttagatt tgttttgcct ttaatacctg taaccttata attctctgtt taaaccaaaa 180 
acttagcttc ttctgaagtc agggtgggga tatttggatc gtgtaagagt gtgttagaag 240 
gtgattatct tttgattcag ttcctttttt gcttcttttg agggggtagc cggggcctcg 300 
gcctcggcgg gttttaatag cccccatcta ttacaaccat tgggcaaaaa catcattaaa 360 
tctgtacaaa gcaaaccctt aatttagttt aattttctgt attctttgat tctttaacag 420 
aagaagaaga gatgccggcc ctaggttgtt gcgtagacgc tactgtttcc cctcctctcg 480 
gctatgcctt ctctcgggat agctctcttc ccgcgccgga gttctttacc tccggcgtac 540 
ctcctacaaa ctccgccgcc ggttccattg ggtctccgga tctgtcctct gctttgtacg 600 
gggtcgatgg gtggggagct ccttatttct ccgttaactc taacggagat atctccgtcc 660 
gaccacatgg tacggacaca ctcccccacc aggaaattga ccttctcaag gtcgtgaaaa 72 0 
aggcctccga cccgaaaaat tcaggggggc tcgggcttca gctgcctctt gttgttcgct 780 
tccctgatgt gctaaaaaac cggttggaat ctctgcaatc ggcttttgat ctcgctgttc 840 
attcccaggg ctatggggcc cactaccaag gtgtttatcc cgtgaaatgc aatcaagaca 900 
ggttcgtggt ggaagatatt gtcaaattcg ggtcgtcatt ccggttcggg ttggaagctg 960 
ggtctaaacc cgagctcctg ttagccatga gctgtctctg caggggcagt gctgagggcc 1020 
ttctcgtttg caatggtttc aaggacgctg agtacatttc gcttgctttg gttgcaagaa 1080 
agctcatgtt aaacactgta attgttcttg aacaagagga ggagcttgac cttgtgattg 1140 
atataagccg taagatggct gttcggcccg taattggact tcgggctaag ctcaggacca 1200 
agcattcagg ccattttgga tccacttctg gagaaaaagg taagtttggg cttacaacga 1260 
cccaaattgt tcgtgtagtg aagaagctgg aagaatccgg aatgctggat tgccttcagt 1320 
tgctgcattt tcacattgga tctcagatcc cttcaacggc gttgcttgct gatggtgttg 1380 
gtgaggctgc tcagatttat tgtgaattaa tccgtcttgg tgcgggtatg aagttcattg 1440 
atactggagg tgggctcgga attgattatg atggtactaa atcatgtgat tcagatgtct 1500 
ctgttggcta tggcattcaa gaatacgcct ccacagttgt ccaggcggtt caatatgttt 1560 
gcgaccgtaa gggcgtgaag cacccagtga tttgcagcga aagtggcagg gcaattgttt 1620 
ctcatcactc aattctgatt ttcgaagccg tgtctgcttc tagtcactca tgttcttctt 1680 
cacatctgtc ttctggtggc ctccaatcca tggcggagac gctcaatgaa gatgcccttg 1740 
ctgattaccg caatttatct gctgctgcag ttcgtggaga gtacgagacg tgtgtacttt 1800 
actctgatca gttgaaacag agatgtgtgg atcagtttaa agaagggtcc ttgggtattg 18 60 
aacatcttgc tgctgttgat agcatctgtg attttgtatc aaaggctatg ggggctgctg 1920 
atcctatccg cacttaccat gtgaatctgt caattttcac ttcaattcct gatttttggg 1980 
cctttggtca attgtttccg attgttccaa tacaccgttt agatgaaaag cctgcagtaa 2040 
ggggaatatt atcggacttg acttgtgaca gtgatgggaa ggttgataag ttcattggtg 2100 
gcgaatcaag cttgcagctg catgaattgg gaagtaatgg cgatggtggt gggtattatc 2160 
tggggatgtt tttgggtggg gcttatgagg aggcgctcgg aggactccac aacctgtttg 2220 
gtggaccaag cgtggtgcgc gtggtgcaga gcgatagcgc tcacagcttc gccatgtctc 2280 
gctccgtccc tggcccgtcc tgcgcggacg tgctccgagc gatgcagcac gagcccgagc 2340 
tcatgttcga gactctcaag caccgtgcgg aggaattctt ggaacaagaa gaagacaaag 24 00 
ggctggccat tgcatctttg gccagcagct tagctcagtc cttccataac atgccttacc 2460 
ttgtggcgcc tgcatcttgc tgcttcactg cagttactgc taacaacggt ggctataact 252 0 
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actattacag tgatgagaat gcagcagatt ctgctacagg ggaggatgag atttggtcct 2580 
attgcactgc ttgaagtgtt gtcgtagcat ctccagtttt agtttgtcgt cgaagttgtc 2640 
tgtttttgaa taataccctt agttggtgat gtttttctaa aaaaaaaaaa aaaaa 2695 

<210> 24 
<211> 720 
<212> PRT 
<213> Plant 

<400> 24 

Met Pro Ala Leu Gly Cys Cys Val Asp Ala Thr Val Ser Pro Pro Leu 
1 5 10 .15 

Gly Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Glu Phe Phe 
20 25 30 

Thr Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Gly Ser He Gly Ser 
35 40 45 

Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp Gly Ala Pro 
50 55 60 

Tyr Phe ser Val Asn Ser Asn Gly Asp He Ser Val Arg Pro His Gly 
65 70 75 . 80 

Thr Asp Thr Leu Pro His Gin Glu He Asp Leu Leu Lys Val Val Lys 
85 90 95 

Lys Ala ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu Pro 
100 105 110 

Leu Val val Arg Phe Pro Asp Val Leu Lys Asn Arg Leu Glu Ser Leu 
115 120 125 

Gin Ser Ala Phe Asp Leu Ala Val His Ser Gin Gly Tyr Gly Ala His 
130 135 140 

Tyr Gin Gly Val Tyr Pro Val Lys Cys Asn Gin Asp Arg Phe Val Val 
145 150 155 160 

Glu Asp He Val Lys Phe Gly Ser Ser Phe Arg Phe Gly Leu Glu Ala 
165 170 175 

Gly Ser Lys Pro Glu Leu Leu Leu Ala Met Ser cys Leu cys Arg Gly 
ISO 185 190 

Ser Ala Glu Gly Leu Leu Val Cys Asn Gly Phe Lys Asp Ala Glu Tyr 
195 200 205 
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He Ser Leu Ala Leu Val Ala Arg Lys Leu Met Leu Asn Thr Val He 
210 215 220 

Val Leu Glu Gin Glu Glu Glu Leu Asp Leu Val He Asp He Ser Arg 
225 230 235 240 

Lys Met Ala Val Arg Pro Val He Gly Leu Arg Ala Lys Leu Arg Thr 
245 250 255 

Lys His Ser Gly His Phe Gly Ser Thr Ser Gly Glu Lys Gly Lys Phe 
260 265 270 

Gly Leu Thr Thr Thr Gin He Val Arg Val Val Lys Lys Leu Glu Glu 
275 280 285 

Ser Gly Met Leu Asp Cys Leu Gin Leu Leu His Phe His He Gly Ser 
290 295 300 

Gin He Pro Ser Thr Ala Leu Leu Ala Asp Gly Val Gly Glu Ala Ala 
305 310 315 320 

Gin He Tyr Cys Glu Leu He Arg Leu Gly Ala Gly Met Lys Phe He 
325 330 335 

Asp Thr Gly Gly Gly Leu Gly He Asp Tyr Asp Gly Thr Lys Ser Cys 
340 345 350 

Asp ser Asp Val Ser Val Gly Tyr Gly He Gin Glu Tyr Ala Ser Thr 
355 360 365 

Val Val Gin Ala Val Gin Tyr Val Cys Asp Arg Lys Gly Val Lys His 
370 375 380 

Pro Val He Cys Ser Glu Ser Gly Arg Ala He Val Ser His His Ser 
385 390 395 400 

He Leu He Phe Glu Ala Val Ser Ala Ser Ser His Ser Cys Ser Ser 
405 410 415 

Ser His Leu Ser Ser Gly Gly Leu Gin Ser Met Ala Glu Thr Leu Asn 
420 425 430 

Glu Asp Ala Leu Ala Asp Tyr Arg Asn Leu Ser Ala Ala Ala Val Arg 
435 440 445 

Gly Glu Tyr Glu Thr Cys Val Leu Tyr Ser Asp Gin Leu Lys Gin Arg 
450 455 460 
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Cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly lie Glu His Leu Ala 
465 470 475 480 

Ala Val Asp Ser He Cys Asp Phe Val Ser Lys Ala Met Gly Ala Ala 
485 490 495. 

Asp Pro He Arg Thr Tyr His Val Asn Leu Ser He Phe Thr Ser He 
500 505 510 

Pro Asp Phe Trp Ala Phe Gly Gin Leu Phe Pro He Val Pro He His 
515 520 525 

Arg Leu Asp Glu Lys Pro Ala Val Arg Gly He Leu Ser Asp Leu Thr 
530 535 540 

Cys Asp Ser Asp Gly Lys Val Asp Lys Phe He Gly Gly Glu Ser Ser 
545 550 555 560 

Leu Gin Leu His Glu Leu Gly Ser Asn Gly Asp Gly Gly Gly Tyr Tyr 
565 570 575 

Leu Gly Met Phe Leu Gly Gly Ala Tyr Glu Glu Ala Leu Gly Gly Leu 
580 585 590 

His JVsn Leu Phe Gly Gly Pro Ser Val Val Arg Val Val Gin Ser Asp 
595 600 605 

Ser Ala His Ser Phe Ala Met Ser Arg Ser Val Pro Gly Pro Ser Cys 
610 615 620 

Ala Asp Val Leu Arg Ala Met Gin His Glu Pro Glu Leu Met Phe Glu 
625 630 635 640 

Thr Leu Lys His Arg Ala Glu Glu Phe Leu Glu Gin Glu Glu Asp Lys 
645 650 655 

Gly Leu Ala He Ala Ser Leu Ala Ser Ser Leu Ala Gin Ser Phe His 
660 665 670 

Asn Met Pro Tyr Leu Val Ala Pro Ala Ser Cys Cys Phe Thr Ala Val 
675 680 685 

Thr Ala Asn Asn Gly Gly Tyr Asn Tyr Tyr Tyr Ser Asp Glu Asn Ala 
690 695 700 

Ala Asp Ser Ala Thr Gly Glu Asp Glu He Trp Ser Tyr Cys Thr Ala 
705 710 715 720 
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<210> 25 
<211> 914 
<212> DNA 
<213> Plant 

<400> 25 

aagctcgann ttaancctca ntaaagggaa 
gtcgacggta tcgataagct tgattaagct 
ttttaggcgc ggcctattcc ctttggctat 
gatttcctcc ataaattctc cgatctaaat 
ttgttggagt tgtttggatg ggtgtttacc 
tccgtaagta acttaagtgc aacatggaaa 
aagcttaatc gaattcctgc agcccggggg 
ggtggagctc caattcgccc tatagtgagt 
caacgtcgtg actgggaaan gcctggcgtt 
ctttcgccag ctgggcgtaa tagcgaanag 
acctgatggn gaatgggacn gccctgtanc 
ccccancgtg acgcnaactt gcaacgccta 
aagttcncgg ctttccgtaa gtccaagcgg 
ccccnaaact gttanggtnt gtgatttggc 
ttaaangcct ntcaacngaa accaccatcg 
nngtaaatta nntn 

<210> 26 
<211> 829 
<212> DNA 
<213> Plant 

<400> 26 

agcaagctcg atatcgccct cactaaaggg 

ggtcgacggt atcgataagc ttgattaanc 

cttctcctna gtaacacatg agaataatta 

ttggatttaa gaatctgcgc tggggaattt 

ttcactgcat tcatgaacga tagtatccgt 

catatgccac tactcacatc cactccattg 

attctccggg ttaattcctc ctgccaagag 

cttaaactga acccagttga atcctttgcc 

ccaatcaacc agagaagact cctcatcaga 

catgaagtac gtatattaca cgtttttccc 

gggggatcng gnattctaga gcggcgccac 

ataaaatcac tggccgtcgt ttanacgncn 

cctgnagcna tccccttcnc agcggngtan 

cacnggatgg aanggacncc gtncgganga 
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caaaagctgg taccgnggcc ccccctcgag 60 

tagtangcac attagcagcg cttgggatga 120 

ataatcgtgt ggttctggga attaaaaccc 180 

ggcanagaat tttcatattt ataccttttc 240 

ccaaagtgtt cctgggactg catgcataca 300 

atttcattga gaggaatcag caaaaaaaaa 360 

atccactagt tctagagcgg ccgccaccgc 420 

cgtattacaa ttcactgggc cgtcgtttta 480 

accaacttaa tcgccttgca gcacatcccc 540 

gccgcacgat cgccttccca acagttgcgc 600 

ngcgcangaa ncgcggcngg tgtggtggta 660 

acgccgcnct tcgcttctcc ttcttctngc 720 

gggnccttag gttcgattat gttnggaccc 780 

accccaaaac gtttcccttg nctggtcntt 840 

gnacttgtta aggntncctt gcntgnaaaa 900 

914 



aacaaaaact ggtaccgggc cccccctcga 60 

tttttttttt tgaactacac aagggaattt 120 

gtgcaataaa ttacaagagg aacattgcag 180 

agcctcaata tttgctacaa ccgtacagat 240 

gacacatcct tttggatgcc gtcctgtcca 300 

ggtttaagtt gcagaaagag cttcacaaac 360 

ccaccc.atgt ttgcttctaa tgcgcggcag 420 

actgccacct tttgcactat ccactagaac 480 

aatatagttc aaaaggctcc tcttcatttg 540 

tgaccaaagc ttaatcgaat tctgcagccc 600 

gcggtggagc tccaatcgcc taaatgancn 660 

ggacgggaaa cctgggtacc aacttaatcg 72 0 

acgaaaggcc gncgattgcc tccanattgc 780 
acngggggnn ggggtaccn 829 



35 



WO0067558 [ file:/A\dcwas03\firmdata\tp\FoleyPat\PatentDocuments\WO0067558.CPCl 



Page 100 of 103 



INTERNATIONAL SEARCH REPORT 


International application No. 




PCT/USOO/12450 



A. CLASSIFICATION OF SUBJECT MATTER 
IPCa) :A01H 5/00; C07H 21/04; C12N 5/14. 15/29. 15/52, 15/82 
US CL :435/?*:.0.1. 414. 419; 536/23.2. 23.6. 24.5; 800/278. 3173 



According to Inlet irational Patent Classification (IPC) or to both national classification and IPC 
a FI ELDS SEARCHED 

Minimum documentation searched (dassitlcation system followed by classification symbols) 
U.S. : 435/320.1. 414, 419; S36/23.2. 23.6. 24.5; 800/278, 317.3 

Documentation searched otlier than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
Please See Extra Sheet 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of tiie relevant passages 



Relevant to claim No. 



X 
Y 
X 
Y 
X 
Y 



HASHIMOTO et al. Intraspecific Variability of the Tandem Repeats 
in Nicotiana Putrescine N-methyltransferase. Plant Molecular 
Biology. 1998, Vol. 37, pages 25-37, especially Figure 3. 

HIBI et al. Gene Expression in Tobacco Low-Nicotine Mutants. The 
Plant Cell. May 1994, Vol. 6, pages 723-735, especially Figure 3. 



IZHAKI et al. A Petunia cDNA Encoding S-Adenosylmethionine 
Synthetase. Plant Physiology. 1995, Vol. 108, pages 841-842, see 
entire article. 



12 

15,16 
12 

15,16 
12 

15,16 



"3c| Further documents are listed in the continuation of Box C. | | See patent family annex. 



* Special categoriei of cited documenti: 

'A' document defming the general state of the axt which it not ooniidered 

to be of particular relevance 

*B' earlier document published on or after the mtemetional filing date 

*L' document which may throw doubti on priority claim(s) or which it 

cited l0 ettablith the publication date of another citation or other 
special rsaton (as specified) 

*0* document referring to an oral ditelosurt, use, exhibition or other 

meant 

document publiihed prior to the intefnationai filing date but btcr than 
the priority date claimed 



'T* later document published after the international filing date or priority 

date and not in conflict with the Application but cited to undentand 
the principle or theory underlying the invention 

*X* document of particular relevance; the claimed invention cannot be 

considered novel or cannot be coiuidered to involve an inventive step 
when the document it uken alone 

*Y* document of particular relevance; the claimed invention cannot be 

contidered to involve an inventive itep when the doctmtent it 
combined with one or more other tueh documenu, tuch combination 
being obvious to a penon skilled in the ait 

*&* document member of the same patent family 



Date of the actual completion of the international search 
17 AUGUST 2000 


Date of mailing ^^^^^"^Q^^^^QQjJ^ report 


Name and mailing address of the ISAAJS 
Commissioner of Patents and Tkmdemarics 

Box FCT 

Washington, aa 20231 
Facsimile No. (703) 305-3230 


Telephone No. (703) 308-0196 \J ^ 





Form PCT/ISA/310 (second sheet) (July 1998)* 



WO0067558 [fite:/A\dcwas03\firmdata\lp\Fo{evPat\PatentDocuments\WQ0067558.CPC] 



Pa ge 101 of 103 



INTERNATIONAL SEARCH REPORT 



International applicclinn No. 
PCTAJSOO/12450 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant t3 claim No. 



LAMATTINA et ai. RNA Editing of the Transcript Coding for 
Subunit 4 of NADH Dehydrogenase in Wheat Mitochondria: 
Uneven Distribution of the Editing Sites Among the Four Exons. 
Nucleic Acids Research 1991. Vol. 19, No, 12. pages 3275-3282, 
especially Figure 4. 

LI et al. Arabidopsis Phosphoribosylanthranilate Isomerase: 
Molecular Genetic Analysis of Triplicate Tryptophan Pathway 
Genes. The Plant Cell. April 1995, Vol. 7, pages 447-461. 
especially Figure 3, page 459. 



12 

15,16 

12,15 
16 



Form PCT/ISA/210 (continuation of second sheet) (July 1998)* 



WO0067558 ffiie:/A\dcwasQ3\firmdata\ip\FolevPat\PatentDocuments\WO0067558.CPC1 



Pa ge 102 of 103 



INTERNATIONAL SEARCH REPORT 



Intemational application No. 
PCTAjSOO/12450 



Boi I Observations where certsin claims wore found unsearchable (Continuation of item 1 of first sheet) 



This interaadonal report has not been established in respect of certain clairas under Article 17(2Xa) for the following reasons: 



□ 



Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. I I Claims Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out. specifically: 



3. Q Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a> 



Box 11 Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This biteraational Searching Authority found multiple inventions in this intemational application, as follows: 
Please See Extra Sheet 



1. I I As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3. ^] As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 
l-lS.18-20 



^' I I required additional search fees were timely paid by the applicant Consequently, this intemational search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest The additional search fees were accompanied by the applicant's protest 

I I No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet(l)) (July 1998)* 



WQ0067558 ffile:/A\clcwas03\firmdata\ip\FQlevPat\PatentDocuments\WO0067558.CPC] 



Page 103 of 103 



INTERNATIONAL SEARCH REPORT 



Inteniationai applicntioti No. 
PCr/USOO/12450 



B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 
STN. AGRICOLA, CAPLUS, BIOSIS, EMBASE. USPAT 

search tenns: putrescine methyltransfemse, adenosylmethionine synthetase, oroithine decarboxylase, arginine 
decarboxylase. NADH dehydrogenase, phosphoribosylanthranilate isomerase, DNA, cDNA, gene, nucleic 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains the following inventions or groups of inventions which arc not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search fees 
must be paid. 

Group I, claim(s) 1-16, drawn to coding DNA, vector, host celt, transgenic plant 
Group II, claim(s) 17, drawn to protein. 

Group III, claim(s} 18-20. drawn to transformation method and transgenic plant with promoter DNA. 



The bventions listed as Groups 1, 11, and III do not relate to a single inventive concept under PCT Rule 13.1 because, 
under PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: 

The coding DNA of Group I, e.g. Claim 12, is disclosed in the prior art publication of Hashimoto e/ al. (Plant 
Mol. Biol. 37: 25-37, 1998; see Fig. 3b). Therefore, there is no special technical feature which links the coding DNA of 
Group I with the protein of Group II. 

Furthermore, there is no special technical feature under PCT Rule 13.2 which links the coding DNA of Group I 
and die transformation method and transgenic plant with the promoter DNA of Group III. Therefore, the inventions of 
Groups I. II, and III do no relate to a single inventive concept under PCT Rule 13.1. 



Form PCT/ISA/210 (extra sheet) (July 1998)* 



